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OSTEOCLAST TRANSPORTER PROTCIN 



Government Support 

This work was funded in part by the NIDDK under Grant Number ROl DK45639. The 
5 United States government may have certain rights to this invention. 



Field of the Invention 

This invention relates to osteoclast transporter genes, proteins coded for by such genes, 
and diagnostics and therapeutics related to medical conditions associated with such genes and 
10 proteins, including osteoporosis and osteopetrosis. 



Background o f the Invention 

The remodeling of bone is a dynamic process. Cells continuously lay down and resorb 
bone material. An imbalance in the activity of cells that lay down new bone (osteoblasts) and 

15 cells that resorb bone (osteoclasts) can resuh in serious, and sometimes even fatal, disorders. 

Osteoporosis is a term used for a number of diseases of diverse etiology, all involving a 
reduction in the mass of bone per unit volume. Osteoporosis is the most common of the 
metabolic bone diseases. Twenty-five million people in the United States and more than two 
hundred million people worldwide are affected by osteoporosis. Osteoporosis is frequent among 

20 post-menopausal women and is an important cause of morbidity in the elderly. It commonly , 
results in bone fractures, and death can be a frequent occurance in the months following 
fractures, particularly those of the hip in elderly individuals. 

Osteopetrosis is a disorder involving an increase in the mass of bone per unit volume. Its 
incidence is rare compared to osteoporosis, but it typically is life threatening. Despite having 

25 multiple causes, a defect in bone resorption is always the underlying mechanism. In many 
instances, the disorder is inherited as an autosomal recessive trait and involves abnormal 
osteoclast function. Bone marrow transplants from normal donors have been attempted to 
restore normal osteoclast precursor cells, but this therapy has shown only limited success. 
Present treatments for osteoporosis and osteopetrosis are inadequate. 

30 There exists a need to influence favorably the bone remodeling process to treat 

osteoporosis and osteopetrosis. There also exists a need to identify the gene(s) responsible for 
osteopetrosis and to provide a genetic therapy for treating osteopetrosis. 
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An object of the invention is to provide compounds that desirably influence the bone 
remodeling process. 

Another object of the invention is to provide therapeutics for treating osteoporosis. 
Another object of the invention is to provide therapeutics for treating osteopetrosis. 
5 Still another object of the invention is to provide diagnostics and research tools relating to 

osteoporosis and osteopetrosis. These and other objects will be described in greater detail below. 



Summary of t he Invention 

The invention involves in one respect the discovery of osteoclast transporter proteins and 

10 nucleic acid molecules encoding those proteins. The expression and biological activity of the 
proteins are necessary for normal osteoclast function, and alteration of the expression or 
biological activity of these proteins can be used to influence osteoclast activity and thereby affect 
bone remodeling. In addition, normal osteoclast function can be established in abnormal 
osteoclasts lacking a normal osteoclast transporter protein by supplying to the abnormal 

15 osteoclast a nucleic acid expressing a functional osteoclast transporter protein. 

The preferred nucleic acids of the invention are homologs and alleles of the nucleic acids 
of SEQ ID NO:l and SEQ ID NO:3. The invention further embraces functional equivalents, 
variants, analogs and fragments of the foregoing nucleic acids and also embraces proteins and 
peptides coded for by any of the foregoing. 

20 According to one aspect of the invention, an isolated nucleic acid molecule is provided. 

The molecule hybridizes under stringent conditions to a molecule consisting of the nucleic acid 
sequence of SEQ ID NO: 1 or SEQ ID NO:7 and it codes for an osteoclast transporter molecule. 
The invention further embraces nucleic acid molecules that differ from the foregoing isolated 
nucleic acid molecules in codon sequence due to the degeneracy of the genetic code. ITic 

25 invention also embraces complements of the foregoing nucleic acids. 

Preferred isolated nucleic acid molecules are those which hybridize under stringent 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID NO:3, 
particularly those comprising the human cDNAs or gene corresponding to SEQ ID NO:3 and the 
cDNAs or gene corresponding to SEQ ID NO:l . 

30 The invention in another aspect involves expression vectors, and host cells transformed or 

transfccted with such expression vectors, comprising the nucleic acid molecules described above. 
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In one embodiment of the invention, the host cell is a hematopoietic osteoclast precursor cell, 
such as a stem cell. 

According to another aspect of the invention, an isolated nucleic acid molecule is 
provided which comprises a unique fragment of SEQ ID NO:l between 12 and 2101 nucleotides 
5 in length or a unique fragment of nucleotides 1-21 12 of SEQ ID NO:3 between 12 and 2111 
nucleotides in length, and complements thereof. In one embodiment, the unique fragment is at 
least 150 and more preferably at least 200 nucleotides in length. In another embodiment, the 
unique fragment is between 12 and 32 contiguous nucleotides in length. In some embodiments, 
the isolated nucleic acid molecule is a unique fragment of SEQ ID NO:l between 12 and 1974 

10 nucleotides in length or a unique fragment of nucleotides 120-1733 of SEQ ID NO:l between 12 
and 1612 nucleotides in length. In another embodiment, the isolated nucleic acid molecule is 
selected from the group consisting of a unique fragment of nucleotides 1-1 179 of SEQ ID NO:3 
between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512-21 12 of 
SEQ ID NO:3 between 12 and 599 nucleotides in length. In certain embodiments, the isolated 

15 nucleic acid molecule is a unique fragment of the coding region of the human osteoclast 

transporter gene. Such isolated nucleic acid molecule include unique fragments of nucleotides 
73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides in length. Preferably, unique 
fragments of the coding region include isolated nucleic acid molecules selected from the group 
consisting of unique fragments of nucleotides 73-1 179 of SEQ ID NO:3 between 12 and 1 105 

20 nucleotides in length and unique fragments of nucleotides 1512-1 758 of SEQ ID NO:3 between 
12 and 245 nucleotides in length. 

According to another aspect of the invention, isolated polypeptides coded for by the 
isolated nucleic acid molecules described above also are provided as well as functional 
equivalents, variants, analogs and fragments thereof. In one embodiment, the polypeptide is a 

25 human osteoclast transporter protein or a functionally active fragment or variants thereof. 

The invention also provides isolated polypeptides which selectively bind an osteoclast 
transporter protein or fragments thereof. Isolated binding polypeptides include antibodies and 
fragments of antibodies (e.g. Fab, F(ab)2, Fd and antibody fragments which include a CDR3 
region which binds selectively to the osteoclast transporter proteins of the invention). Preferred 

30 isolated binding polypeptides are those that bind an extracellular portion of the osteoclast 
transporter proteins of the invention. 
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The invention in another aspect involves a method for decreasing osteoclast activity in a 
subject. An agent that selectively binds to an isolated nucleic acid molecule as described above 
or an expression product thereof is administered to a subject in need of such treatment, in an 
amount effective to decrease osteoclast activity in the subject. In one embodiment, the agent 
5 selectively binds to an extracellular domain of the expression product. Preferred agents are 
modified antisense nucleic acids and polypeptides. 

The invention is a further aspect provides for the use of agents which bind to the 
foregoing nucleic acids and expression products thereof in the preparation of a medicament. 
Preferred agents are modified antisense nucleic acids and polypeptides. 
10 The invention also contemplates gene therapy for osteopetrosis, wherein defective stem 

cells of a donor are genetically engineered to include an isolated nucleic acid expressing a 
functional osteoclast transporter protein. The cells then are returned to the donor. 

In the same manner as described above kidney cell tubule activity may be modulated 
using the methods and products of the invention. 



15 



Brief Description of the Drawing 

FIG. 1 is a graph depicting hydrophobicity vs. amino acid for the mouse osteoclast 
transporter protein. 



20 Detailed Description of the Invention 

The present invention in one aspect involves the cloning of a gene encoding an osteoclast 
transporter protein. The sequence of the gene (from mouse) is presented as SEQ ID NO: 1 . and 
the predicted amino acid sequence of this gene's protein product is presented as SEQ ID NO:2. 
The mRNA transcript is about 2,0 kb in length. It was obtained from mouse kidney tubule cells. 

25 A partial clone of the apparent human homolog was identified in GenBank, accession no. 

H20345, and is presented as SEQ ID NO:7. The partial clone is available from American Type 
Culture Collection Depository. Rockville, Maryland. The human cDNA encoding the osteoclast 
transporter protein was isolated by a standard hybridization protocol using probes consisting of 
the 5* end of the mouse gene (nucleotides 45-357 of SEQ ID NO: 1 ) and the partial human gene 

30 sequence (SEQ ID NO:7). Tlie sequence of the full length human cDNA is presented as SEQ ID 
NO:3, and the predicted amino acid sequence of the human protein product is presented as SEQ 
ID NO:4. 
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The mouse gene maps to the proximal portion of chromosome 19, in a region to which 
the osteosclerosis (oc) mutation has been assigned previously. The phenotype of oc/oc mutant 
mice includes osteopetrosis, and the osteoclasts of these mutant mice, although present, appear to 
be nonfunctional. The osteoclast transporter protein of the present invention is not expressed in 
5 oc/oc mutant mice, although Southern analysis indicates that the transporter locus is 

unrearranged and intact in these mutant mice. The osteoclast transporter protein of the present 
invention, however, is expressed in normal osteoclasts, and it is believed that hereditziry 
osteopetrosis results from mutations to this gene, which impair osteoclast function. 

The gene is not expressed in any other normal tissue tested, except in kidney tubule cells 
10 where its function is believed to be redundant. In particular. Northern analysis demonstrated that 
the gene is not expressed in normal skeletal muscle, pancreas, lung, heart, brain, stomach, spleen, 
salivary gland, thymus, liver, large intestine or small intestine tissue. 

Searches of GenBank for similar related proteins shows that this protein shares some very 
limited, localized homology and sequence motifs to known proteins with transport functions. 
15 The protein of the invention, however, does have a length and tertiary structures similar to 
known transporters. 

Specifically, it appears to be a member of a large family of genes with transport functions 
called the "major facilitator superfamily" by Marger and Saier, which include a number of uni-, 
sym- and antiporters specific for sugars, organic acids and drugs. The common structural motif 

20 of this family are 12 transmembrane alpha helices, and often includes a central cytoplasmic loop 
between the 6th and 7th membrane-spanning domains. The hydrophobicity plot for the protein 
of the present invention is shown in Fig. 1. Each asterisk indicates a potential membrane 
spanning hydrophobic domain. There appears to be an extracellular loop between the first and 
second hydrophobic domains, an intra-cytoplasmic loop between the hydrophobic domains 6 and 

25 7, and a 3' intra-cytoplasmic tail. The 5' end also appears to be internal within the cytoplasm. 

The transporter also contains amino-acid motifs found in this class of molecules, 
including a sequence, D-R-F-G-R-K (SEQ ID NO:5), similar to a D-R/K-X-R-R/K sequence 
(SEQ ID NO:6) after the 2nd membrane domain, and a P-E-S/T sequence found after the 12th 
transmembrane domain. For genes with known fijnction, the transporter is most closely related 

30 to the polyspecific rat cation transporter Octl (30% amino acid identity and 50% similarity), 
which has been shown to transport a variety of organic cationic drugs (Grundcmann et al.. 
Nature. Vol. 372, December 1994.) 
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The invention thus involves osteoclast transport proteins, genes encoding those proteins* 
functional modifications and variants of the foregoing, useful fragments of the foregoing, as well 

as therapeutics and diagnostics relating thereto. 

Homologs and alleles of the osteoclast transporter genes of the invention can be identified 
5 by conventional techniques. Thus, an aspect of the invention is those nucleic acid sequences 
which code for osteoclast transporter proteins and which hybridize to a nucleic acid molecule 
consisting of SEQ ID NO:l or SEQ ID NO:3, under stringent conditions. The term "stringent 
conditions'' as used herein refers to parameters with which the art is familiar. More specifically, 
stringent conditions, as used herein, refers to hybridization at GS^'C in hybridization buffer (3.5 x 
10 SSC, 0.02% FicolU 0.02% polyvinyl pyrolidonc, 0.02% Bovine Serum Albumin, 2.5mM 

NaH,P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium chloride/0.1 5M sodium citrate, 
ph7; SDS is sodium dodecyl sulphate; and EDTA is ethylcncdiaminetetracetic acid. After 
hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC at room 
temperature and then at 0.1 x SSC/0.1 x SDS at 65°C. 
15 There are other conditions, reagents, and so forth which can used, which result in a 

similar degree of stringency . The skilled artisan will be familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permit the clear identification of homologs and alleles 
of osteoclast transporter proteins of the invention. The skilled artisan also is familiar with the 
20 methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NOs: 1 or 3 and SEQ ID NOs:2 or 4, 
respectively, in some instances will share at least 50% nucleotide identity and/or at least 65% 
25 amino acid identity and in still other instances will share at least 60% nucleotide identity and/or 
at least 75% amino acid identity. Watson-Crick complements of the forgoing nucleic acids also 
are embraced by the invention. 

In screening for osteoclast transporter protein family members, a Southern blot may be 
performed using the foregoing conditions, together with a radioactive probe. After washing the 
30 membrane to which the DNA is finally transferred, the membrane can be placed against x-ray 
film to detect the radioactive signal. 
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The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 
5 the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in v/vo, to incorporate a serine residue. Similarly, nucleotide sequence triplets which 
encode other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT 
(proline codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG 
and ACT (threonine codons); AAC and AAT (asparaginc codons); and ATA, ATC and ATT 
10 (isoleucine codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The invention also provides isolated unique fragments of SEQ ID NO: 1 , SEQ ID NO:3, 
or compliments of SEQ ID NO: 1 or SEQ ID NO:3. A unique fragment is one that is a 
15 'signature' for the larger nucleic acid. It, for example, is long enough to assure that its precise 
sequence is not found in molecules outside of the osteoclast transporter protein family as defmed 
by claim 1 . Unique fragments can be used as probes in Southern blot assays to identify family 
members or can be used in amplification assays such as those employing PCR. As known to 
those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain 
20 uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. 
Unique fragments also can be used to produce fusion proteins for generating antibodies or for 
generating immunoassay components. Likewise, unique fragments can be employed to produce 
fragments of the osteoclast transporter protein such as only the extracellular portion, useful, for 
example, in immunoassays or as a competitive inhibitor of the substrate of the osteoclast 
25 transporter protein in therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of the osteoclast transporter proteins of the invention, 
particularly for therapeutic purposes as described in greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l and SEQ 
30 ID NO:3, and their complements, will require longer segments to be unique while others will 
require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Virtually any segment of 
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SEQ ID NO: I or SEQ ID NO:3, or their complements, that is 1 8 or more nucleotides in length 
will be unique. Those skilled in the art arc well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the sequence 
of interest from non-family members. A comparison of the sequence of the fragment to those on 

5 known data bases typically is all that is necessary, although in vitro confirmatory hybridization 
and sequencing analysis may be performed. 

As mentioned above, the invention embraces antiscnse oligonucleotides that selectively 
bind to a nucleic acid molecule encoding an osteoclast transporter protein, to decrease osteoclast 
activity. This is desirable in virtually any medical condition wherein a reduction in osteoclast 

10 activity is desirable, including when a reduction in bone loss or an increase in bone mass is 
desired. By decreasing the osteoclast activity in a subject, bone remodeling thus can be 
favorably affected. Antisense molecules, in this manner, can be used to slow down or arrest the 
loss in bone mass occurring with certain forms of osteoporosis and may even result in an 
increase in bone mass in circumstances where an increase in bone mass is desirable. 

15 As used herein, the term "antisense oligonucleotide" or ' antisense" describes an 

oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonuclcotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mRNA transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The 

20 antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene. Those skilled in the art will recognize that the 
exact length of the antisense oligonucleotide and its degree of complementarity with its target 
will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 

25 be constructed and arranged so as to bind selectively with the target under physiological 

conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell imder physiological conditions. Based upon SEQ ID NO: 1 and SEQ ID NO:3, 
or upon allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily 
choose and synthesize any of a number of appropriate antisense molecules for use in accordance 

30 with the present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target. Most preferably, the antisense 
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oligonucleotidcs comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal or 
5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
5 addition, 3 '-untranslated regions or telomerase sites may be targeted. Targeting to mRNA 

splicing sites has also been used in the art but may be less preferred if alternative mRNA splicing 
occurs. In addition, the antisense is targeted, preferably, to sites in which mRNA secondary 
structure is not expected (sec, e.g., Sainio et al.. Cell Mol. Neurobiol. 14(5):439-457 (1994)) and 
at which proteins are not expected to bind. Finally, although, SEQ ID NOs: 1 and 3 disclose 

10 cDNA sequences, one of ordinary skill in the art may easily derive the genomic DNAs 

corresponding to the cDNAs of SEQ ID NOs:l and 3. Thus, the present invention also provides 
for antisense oligonucleotides which are complementary to the genomic DNAs corresponding to 
SEQ ID NOs: 1 and 3. Similarly, antisense to allelic or homologous cDNAs and genomic DNAs 
are enabled without undue experimentation. 

15 In one set of embodiments, the antisense oligonucleotides of the invention may be 

composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. That 
is, the 5* end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out manually 

20 or by an automated synthesizer. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target but which enhance 
their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

25 The term "modified oligonucleotide" as used herein describes an oligonucleotide in 

which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide 
and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with 
nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 

30 intemucleoside linkages include phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkx lphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates. and carboxymethyl esters. 
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The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular weight 
organic groups other than a hydroxy! group at the 3' position and other than a phosphate group at 

5 the 5* position. Thus modified oligonucleotides may include a 2 -O-alkylated ribose group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. TTie 
present invention, thus, contemplates pharmaceutical preparations containing modified antisense 
molecules that arc complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding osteoclast transporter proteins, together with pharmaceutically acceptable 

1 0 carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical composition 
or medicament. Such a pharmaceutical composition or medicament may include the antisense 
oligonucleotides in combination with any standard physiologically and/or pharmaceutically 
acceptable carriers which are known in the art. The compositions should be sterile and contain a 
15 therapeutically effective amount of the antisense oligonucleotides in a unit of weight or volume 
suitable for administration to a patient. The term "pharmaceutically acceptable" means a non- 
toxic material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 
20 characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art. 

The invention also involves expression vectors coding for osteoclast transporter proteins 
and fragments and variants thereof and host cells containing those expression vectors. Virtually 
25 any cells, prokaryotic or eukaryotic, which can be transformed with heterologous DNA or RNA 
and which can be grown or maintained in culture, may be used in the practice of the invention. 
Examples include bacterial cells such as E. coli and mammalian cells such as mouse, hamster, 
pig, goat, primate, etc. They may be of a wide variety of tissue types, including mast cells, 
fibroblasts, oocytes and lymphocytes, and they may be primary cells or cell lines. Specific 
30 examples include CHO cells and COS cells. Cell-free transcription systems also may be used in 
lieu of cells. In gene therapy applications, human hematopoietic cells that are precursors of 
osteoclasts are contemplated. 
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As used herein, a "vector" may be any of a number of nucleic acids into which a desired 
sequence may be inserted by restriction and ligation for transport between different genetic 
environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. 
5 A cloning vector is one which is able to replicate in a host cell, and which is further characterized 
by one or more endonuclease restriction sites at which the vector may be cut in a determinable 
fashion and into which a desired DNA sequence may be ligated such that the new recombinant 
vector retains its ability to replicate in the host cell. In the case of plasmids, replication of the 
desired sequence may occur many times as the plasmid increases in copy number within the host 
10 bacterium or just a single time per host before the host reproduces by mitosis. In the case of 
phage, replication may occur actively during a lytic phase or passively during a lysogenic phase. 
An expression vector is one into which a desired DNA sequence may be inserted by restriction 
and ligation such that it is operably joined to regulatory sequences and may be expressed as an 
RNA transcript. Vectors may further contain one or more marker sequences suitable for use in 
15 the identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose 
activities are detectable by standard assays known in the art (e.g. B-galactosidase or alkaline 
phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, 
20 hots, colonies or plaques. Preferred vectors are those capable of autonomous replication and 
expression of the structural gene products present in the DNA segments to which they are 
operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked In such a way as to place the expression or transcription 

25 of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5' regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the 

30 ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 
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capable of effecting transcription of that DNA sequence such that the resulting transcript might 
be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5* non- transcribing and 
5 5* non-translating sequences involved with the initiation of transcription and translation 

respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, 
such 5* non-transcribing regulatory sequences will include a promoter region which includes a 
promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences 
may also include enhancer sequences or upstream activator sequences as desired. The vectors of 
10 the invention may optionally include 5' leader or signal sequences, 5' or 3*. The choice and design 
of an appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are commercially 
available and known to those skilled in the art. See Sanbrook et al., MQ^PCujgtr Cloning; A 
Laboratorv Manual . Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are 
15 genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
the osteoclast transporter protein or fragment or variant thereof. That heterologous DNA (RNA) 
is placed under operable control of transcriptional elements to permit the expression of the 
heterologous DNA in the host cell. In still another aspect of the invention, a defective osteoclast 
or precursor thereof is treated with DNA in a manner to promote via homologous recombination 
20 intracellularly the correction of a defective osteoclast transporter gene. 

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogcn, San Diego, CA) that contain a selectable marker such as a gene thai 
confers G41 8 resistance (which facilitates the selection of stably transfccted cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 
25 expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an 
Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of plasmid as a 
multicopy cxtrachromosomal element. 

mRNA expression and transporter function can be tested using these vectors in a wide 
variety of mammalian cell lines. Preferred systems include cells derived from kidney tubules, 
30 including MDCK and IMCD-3 cells (both available from ATCC). 

A variety of systems for expression of proteins in bacterial, yeast, mammalian, or insect 
cells have been described and are commercially available. Preferred systems include the 
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Glutathione-S-transferase (GST) Gene Fusion system available from Pharmacia Biotech, 
Piscataway, NJ. In this system a plasmid is constructed containing the protein sequence of 
interest (in this case, the transporter including the first extracellular domain) inserted in frame 
downstream of the 25 kDa GST domain from S. japonicum . Expression of the fusion protein can 
5 be induced in transfected bacterial cells and the fusion protein purified by affinity 

chromatography using Glutathione Sepharose 4B. Cleavage of the desired peptide from the GST 
sequences is achieved using a site specific protease whose recognition sequence is located 
immediately upstream from the cloning site. An alternative system which is desirable since it 
maintains eucaryotic-specific functions such as glycosylation is recombination into baculovirus. 
10 Standard protocols exist (c.f. O'Reilly et al., Baculovirus Expression Vectors: A :Laboratory 
Manual, IRL/Oxford University Press, 1992) and vectors, cells, and reagents are commercially 
available. 

The invention also involves polypeptides which bind to osteoclast transporter proteins 
and in certain embodiments preferably to the extracellular domain of the proteins. Such binding 

15 partners can be used in screening assays to detect the presence or absence of the osteoclast 

transporter protein and in purification protocols to isolate osteoclast transporter proteins. Such 
binding partners also can be used to inhibit the native activity of the osteoclast transporter protein 
by binding to the extracellular domain of such proteins. Likewise, such binding partners can be 
used to selectively target drugs, toxins or other molecules to osteoclasts. In this manner, 

20 osteoclast activity may be selectively increased by drugs that would enhance osteoclast activity 
or selectively impaired by dnigs that would inhibit osteoclast activity (e.g. toxins, antisense, 
etc.). In this manner, bone remodeling may be desirably affected. 

The invention, therefore, involves antibodies or fragments of antibodies having the ability 
to selectively bind to osteoclast transporter proteins, and preferably to the extracellular domain 

25 thereof Antibodies include polyclonal and monoclonal antibodies, prepared according to 

conventional methodology. Antibodies were raised according to standard procedures. A fusion 
protein Was prepared using the pGEX-4T-2 vector (Pharmacia, Piscataway, NJ). The fusion 
protein contains a glutathione-S-transferase (GST) fragment joined to the terminal 84 amino 
acids of the osteoclast transporter. The fusion protein was expressed and injected into rabbits to 

30 produce antibodies as rabbit immune serum. The antibodies so generated detected a protein of 
molecular weight which corresponds to the osteoclast transporter in tissue prepared from normal 
mouse kidneys. The protein is significantly reduced or absent in kidney tissue prepared from 
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odoc mutant mice, consistent with the absence of detectable osteoclast transporter mRNA by 

Northern blot in these mice. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 

5 W,R. (1 986) The Experimental Foundations of M odern Immunology Wiley & Sons, Inc., New 
York; Roiu, L (1991) F.<i5;ential Immunology, 7th Ed., Blackwell Scientific Publications, 
Oxford). The pFc* and Fc regions, for example, are effectors of the complement cascade but arc 
not involved in antigen binding. An antibody from which the pFc' region has been enzymatically 
cleaved, or which has been produced without the pFc* region, designated an F(ab*)2 fragment, 

10 retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 
the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further, Fab fragments consist of a covalently bound antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 

1 5 determinant of antibody specificity (a single I-d fragment may be associated with up to ten 
different light chains without altering antibody specificity) and Fd fragments retain epitope- 

binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
complementarity determining regions (CDRs), which directly interact with the epitope of the 
20 antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, 
in general. Clark, 1986; RoitU 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four firamework regions (FRl through FR4) separated 
respectively by three complementarity determining regions (CDRl through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely 

25 responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 
the epitopic specificity of the original antibody. This is most clearly manifested in the 
development and use of "humanized" antibodies in which non-human CDRs are covalently 

30 joined to human FR and/or Fc/pFc' regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/0438 1 teaches the production and use of 
humanized murine RSV antibodies in which at least a portion of the murine FR regions have 
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1 

been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab')2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
5 and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 
human or non-human sequences; chimeric F(ab*)2 fragment antibodies in which the FR and/or 
CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDRl and/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 
10 human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDRl and/or 
CDR2 regions have been replaced by homologous human or non-human sequences. The present 
invention also includes so-called single chain antibodies. Thus, the invention involves 
polypeptides of numerous size and type that bind specifically to osteoclast transporter proteins. 
These polypeptides may be derived also from sources other than antibody technology. For 
15 example, such polypeptide binding agents can be provided by degenerate peptide libraries which 
can be readily prepared in solution, in immobilized form or as phage display libraries. 
Combinatorial libraries also can be synthesized of peptides containing one or more amino acids. 
Libraries further can be synthesized of peptoids and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 

20 according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or lambda 
phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent a completely degenerate or biased array. One then can select phage- 
bearing inserts which bind to the osteoclast transporter protein. This process can be repeated 
through several cycles of reselection of phage that bind to the osteoclast transporter protein. 

25 Repeated rounds lead to enrichment of phage bearing particular sequences. DNA sequence 

analysis can be conducted to identify the sequences of the expressed polypeptides. The minimal 
linear portion of the sequence that binds to the osteoclast transporter protein can be determined. 
One can repeat the procedure using a biased library containing inserts containing part or all of the 
minimal linear portion plus one or more additional degenerate residues upstream or downstream 

30 thereof. Thus, the osteoclast transporter molecule of the invention, an extracellular domain 
thereof, or the like, can be used to screen peptide libraries, including phage display libraries, to 
identify and select peptide binding partners of the extracellular portion of the o$teocIast 
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transporter protein of the invention. Such molecules can be used, as described, for screening 
assays, for diagnostic assays, for purification protocols, for interfering directly with the 
functioning of osteoclasts by binding to the osteoclast transporter protein or for targeting drugs, 
toxins and/or labeling agents (e.g. radioisotopes, fluorescent molecules, etc.) to osteoclasts 
5 and/or osteoclast transporter proteins. Drug molecules that would affect osteoclast activity and 
toxin molecules that would disable or destroy osteoclast cells arc known to those skilled in the 
art and arc commercially available. For example, the immunotoxin art provides examples of 
toxins which are effective when delivered to a cell by an antibody or fragment thereof. 
Examples of toxins include ribosome-damaging toxins derived from plants or bacterial such as 
10 ricin, abrin, saporin, Pseudomonas endotoxin, diphtheria toxin, A chain toxins, blocked ricin, etc. 

Osteoclast activity can be assayed both in vitro and in vivo. Heterogeneous culture 
systems in which bone marrow derived cells are co-cultured with osteoblast cells in the presence 
of 1,25-dihydroxy vitamin D3 will generate mature osteoclasts, whose activity can be quantitated 
by measuring their ability to form resorption pits on dentine slices (Takahashi et ai., 
15 Endocrinology^ Vol. 122, No. 4, 1988). In mammals, decreased osteoclast function results in the 
clinical condition of osteopetrosis. In rodents, this can be readily scored by virtue of the fact 
that, in this case, incisors fair to erupt after birth. ITius, rodent neonates can be treated with 
compositions of the invention and the effect on osteoclast activity can be determined by scoring 
incisor eruption. The degree of osteopetrosis can be more accurately quantitated by 
20 morphometrical analysis of the bone marrow space (Boyce et al., ./. Clin. Invest., Vol. 90, 1 992). 

When used therapeutically, the compounds of the invention are administered in 
therapeutically effective amounts. In general, a therapeutically effective amount means that 
amount necessary to delay the onset of, inhibit the progression of, or halt altogether the particular 
condition being treated. Therapeutically effective amounts specifically v^ll be those which 
25 desirably influence osteoclast activity. When it is desired to decrease osteoclast activity, then 
any inhibition of osteoclast activity is regarded as a therapeutically effective amount. When it is 
desired to increase osteoclast activity, then any enhancement of osteoclast activity is regarded as 
a therapeutically effective amount. Generally, a therapeutically cfleciivc amount will var>' with 
the subject's age, condition, and sex, as well as the nature and extent of the disease in the subject, 
30 all of which can be determined by one of ordinary skill in the art. The dosage may be adjusted 
by the individual physician or veterinarian, particularly in the event of any complication. A 
therapeutically effective amount typically varies from 0.01 mg/kg to about 1000 mg/kg. 
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preferably from about 0.1 mg/kg to about 200 mg/kg and most preferably from about 0.2 mg//kg 
to about 20 mg/kg, in one or more dose administrations daily, for one or more days. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 
oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 
capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's Pharmaceutical 
Sciences . 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution. 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available documents. In general, it involves introduction in 
vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of 
the gene, and returning the genetically engineered cell(s) to the subject The functional copy of 
the gene is under operable control of regulatory elements which permit expression of the gene in 
the genetically engineered cell(s). Numerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
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are described in PCX application WO95/00654. In viy0 gene therapy using vectors such as 
adenovirus also is contemplated according to the invention. 

Thus, defective osteoclasts or precursors thereof are provided with a non-defective 
nucleic acid encoding an osteoclast transporter protein. In particular, such gene therapy is 
5 appropriate for hereditary forms of osteopetrosis involving defective osteoclast transporter 
protein genes. For example, primary human blood cells which are precursors of osteoclasts can 
be obtained from the bone marrow of a subject who is a candidate for such gene therapy. 
Candidates can be identified by screening for abnormal osteoclast function that results from a 
defective osteoclast transporter protein. Then, such cells can be genetically engineered ex vivo 
10 with DNA (RNA) encoding a normal osteoclast transporter protein. The genetically engineered 
cells then arc returned to the patient. 

That the transporter can be used to correct the defect in the clinical conditions of 
osteopetrosis in which it has been inactivated is demonstrated using a mouse model system. 'ITie 
osteosclerosis (oc) mutant mouse fails to express the transponer and has osteopetrosis. The 
5 transporter is introduced into a vector such as pXTl (Stralegene, La Jolla, CA) that, when 
transfected into an appropriate packaging cell line, will generate a higher titer of replication- 
defective retrovitral particles carrying the transporter under the regulatory control of a thymidine 
kinase promoter. Hematopoietic progenitor cells from oc mutant mice are prepared and, using 
well-documented techniques of infection and selection, stem cells carrying this vector arc 
selected after coculture with the retrovirus. Re-population of lethally irradiated oc mutant mice 
with this transfected population is accomplished using documented protocols of hematopoietic 
transplantation, and the degree of amelioration of the osteopetrosis phenotype then is determined 
by radiography and morphometrical analysis. 

While the invention has been described with respect to certain embodiments, it should be 
appreciated that many modifications and changes may be made by those of ordinary skill in the 
art without departing from the spirit of the invention. It is intended that such modification, 
changes and equivalents fall within the scope of the following claims. 
A Sequence Listing is provided, followed by what is claimed: 



9742321A1 I > 



wo 97/42321 



- 19- 

SBQUENCE LISTING 



PCTAJS97/07856 



(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME: BRIGHAM AND WOMEN'S HOSPITAL, INC. 

(B) STREET: 75 FRANCIS STREET 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

10 (E) COUNTRY: UNITED STATES OF T^yERICA 

(F) POSTAL CODE: 02115 



(i) APPLICANT/ INVENTOR: 

(A) NAME: BEIER, DAVID R. 

15 (B) STREET: 39 ARLIl^STON ROAD 

(C) CITY: BROOKLINE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02146 

20 

(i) APPLICANT/ INVENTOR: 

(A) NAME: BRADY, KEVIN P. 

(B) STREET: 3N BENNET COURT 

(C) CITY: BOSTON 

25 (D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02113 



30 



(ii) TITLE OF INVENTION: OSTEOCLAST TRANSPORTER 



(iii) NUMBER OF SEQUENCES: 7 
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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WOLF, GREENFIELD & SACKS, P.C. 

(B) STREET: 600 ATLANTIC AVENUE 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02210 



(v) COMPUTER READABLE FORM: 
10 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 



15 (vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



20 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/647,397 

(B) FILING DATE: 09-MAY-1996 



(viii) ATTORNEY/AGENT INFORMATION: 
25 (A) NAME: Gates, Edward R. 

(B) REGISTRATION NUMBER: 31,616 

(C) REFERENCE/DOCKET NUMBER: B0801/7048WO 



(ix) TELECOMMUNICATION INFORMATION 
30 (A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 
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(2) INFORMATION FOR SEQ ID 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) 7^1 -SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 120. .1733 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TGOGAGOGTG CTACTACAGC AGCTGCTGAA CCTAGACAGG CACGGCAACT GCTGCATCCA 60 

25 

GCTCCAGCCC AACTGAATCC AGCTCCAACC ACCAGTnTG GTTCATCTTG CCTQGTGCC 119 



ATG ACC TTC TCC GAG ATT CTG GAC OCT GTT GGA AGO ATG GGC CCC TTC 167 
Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 
30 1 5 10 15 " 



GAG TAC CTG CAT GTG ACC TTG CTG GCC CTC CCA ATC CTC GGA ATA GCC 215 
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Gln Tyr Leu His Val Thr Leu Leu Ala Leu Pro lie Leu Gly lie Ala 

20 25 30 

AAC CAC AAC ITG CTA GAG ATC TTC ACA GCC ACC ACC CCT GAC CAC CAC 263 
5 Asn His Asn Leu Leu Gin lie Phe Thr Ala Thr Thr Pro Asp His His 

35 40 45 

TCT CGC CCG CCC CCC AAC GCC TCT CTA GAG CCC TOG GTA CTC CCC TTG 311 
Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
10 50 55 60 



GGC CCA AAC GGG AAG CCT GAG AAG TGT CTC CGC TTC GTG CAT CTG CCA 359 
Gly Pro TVsn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 
65 70 75 80 

AAC GCC ACT CTT CCC AAT GAC ACC CAG GGG GCC ACC GAG CCA TGC TTG 407 
Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 

85 90 95 



20 GAT GGC TOG ATC TAC AAC AGC ACC AGA GAC ACC ATT GTG ACA GAG TGG 455 
Asp Gly Trp lie Tyr Asn Ser Thr Arg Asp Thr He Val Thr Glu Trp 

100 105 110 

GAC TTG GTA TGC GGC TCC PAC AAA CTG AAG GAG ATG GCA CAG TCA GTC 503 
25 Asp Leu Val Cys Gly Ser Asn Lys Leu Lys Glu Met Ala Gin Ser Val 

115 120 125 

TTC ATG GCA GGT ATA CTG GTT GGA GGA CCT GTG TTT GGA GAA CTG TCA 551 
Phe Met Ala Gly lie Leu Val Gly Gly Pro Val Phe Gly Glu Leu Ser 
30 130 135 140 

GAC AGG TIT GGC CGC AAG CCC ATC CTG ACC TGG AGC TAT CTC TTG CTG 599 
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Asp Arg Phe Gly Arg Lys Pro lie Leu Thr Trp Ser Tyr Leu Leu Leu 
145 150 155 160 

GCA GCC AGT GGC TCX: AGT GCT GCC TTC AGC CCX: AGC CTC ACT GTC TAT 647 
5 Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 

165 170 175 

ATG ATC TTC OGA TTC CTG TGT GGC TGC AGC ATC TCG GGC ATT TCP CTG 695 
Met lie Phe Arg Phe Leu Cys Gly Cys Ser lie Ser Gly lie Ser Leu 
10 180 185 190 



15 



AGC ACC ATT ATC TTG AAT GTG GAA TOG GTA CCC ACC TCC AOG COG GCC 743 
Ser Thr lie lie Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg Ala 
195 200 205 



ATC TCA TCA ACA ACT ATT GGG TAC TGC TAC ACC ATT GGT CAA TTC ATT 791 
lie Ser Ser Thr Thr lie Gly Tyr Cys Tyr Thr lie Gly Gin Phe lie 
210 215 220 



20 CTG CCT GGC CTG GCC TAT GCC GTT CCT CAG TGG CGC TGG CTA CAG TTG 



839 



Leu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 



225 



230 



235 



240 



TCC GTG TCP GCT GCC TTC TTC ATC TTC TCC TTG TTG TCC TGG TGG GTA 



887 



25 Ser Val Ser Ala Ma Phe Phe lie Phe Ser Leu Leu Ser Trp Trp Val 



245 



250 



255 



CCA GAG TCC ATA OGC TGG CTG GTT CTG TCT GGA AAA TTC TCA OGA GCT 



935 



Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 



30 



260 



265 



270 



CTG AAG ACA CTC CAA CGT CTG GCT ACC TTC AAC GGC AAG AAG GAG GAA 



983 
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Leu Lys Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 
275 280 285 



GGG GAA 7^ CTC ACT GTG GAG GAG CTG AAG TTC AAC TTG GAG AAG GAC 1031 
5 Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 
290 295 300 



ATC ACC TCA GCC AAG GTC AAA TAT GGC TTA TCT GAC TTG TTC CGA GTG 1079 
lie Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
10 305 310 315 320 



TCC ATC CTG CGC CGT GTG ACC TTC TGT CTC TCT CTG GCC TOG TTT GCT 1127 
Ser lie Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 

325 330 335 



ACT GGC TIT GCC TAC TAG AGT TTG GCT ATG GGA GTA G?A GAA TTT GGA 1175 
Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 

340 345 350 



20 GTC AAC ATC TAC ATA CTC CAG ATC ATC TTC GGT GOG GTT GAC ATT CCC 1223 
Val Asn lie Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp lie Pro 
355 360 365 



GCC MiG TTC ATC ACA ATC CTC TCC ATA AGT TAT CTG GGC CGG CGC ATC 1271 
25 Ala Lys Phe lie Thr lie Leu Ser lie Ser Tyr L6u Gly Arg Arg lie 
370 375 380 



ACT CAG GGC TTC CTC CTG ATC CTG GCA GGA GTG GCC ATC CTG GCC CTC 1319 
Thr Gin Gly Phe Leu Leu lie Leu Ala Gly Val Ala lie Leu Ala Leu 
30 385 390 395 400 



ATC TTT GTG TCT TCA GAA ATG CAG CTC TTG AGA ACA GCA CTG GCT GTA 1367 
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Ile Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Val 

405 410 415 



TTT GGG AAG GGA TGC CTG TCT GGC TCC TTC AGC TGC CTC TTC CTC TAG 1415 
5 Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 

420 425 430 



ACA AGT GAG CTC TAG CCT ACA GTC CTC AGG CAA ACA GGT ATG GGT ATC 1463 
Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly lie 
10 435 440 445 



AGT AAC ATA TGG GGT CGA GTG GGA AGT ATG ATA GCC CCA CTG GTG AAA 1511 
Ser Asn lie Trp Ala Arg Val Gly Ser Met lie Ala Pro Leu Val Lys 
450 455 460 



15 



ATC AOG GGA GAA CTG CAG CCC TTC ATC CCI AAT GTC ATC TTT TOG ACC 1559 
He Thr Gly Glu Leu Gin Pro Phe He Pro Asn Val lie Phe Trp Thr 
465 470 475 480 



20 ATG Ad CPA CTG GGA GGC AGT GCP GCC TTC TTT CTG CTT GAG ACC CTC 1607 
Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 

485 490 495 



AAT CGG CCC TTA CCA GAA ACT ATC GAG GAC ATA CAA GAC TOG TAG CAG 1655 
25 Asn Arg Pro Leu Pro Glu Thr He Glu Asp He Gin Asp Trp Tyr Gin 

500 505 510 



CAA ACC AAG AAA ACA AAG CAG GAG CCA GAA GCA GAA AAG GCA TCC CAG 1703 
Gin Ihr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin 
30 515 520 525 



ACA ATC COG CTG AAG ACT GGT GGA CCC TT^GCTAAGAA CAACAGAATC 1750 
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Thr lie Pro Leu Lys Thr Gly Gly Pro 



530 



535 



CTCmCCTG CCrrcCAGAG ACTGATCCCA AGCAGGGCCC TTCCAAGGCT ATTOSAGCAC 1810 



CTTAGGGGTT GGGTQGAGCC CCAGCTGGCT CCATGCTCTC AGSACAT^AGA CTTCTGAGAG 1870 



TTCAGCAAAA GTGnTTACC TTCACCACCr CCACCX3TAGC CCACAACXXA GACCTGGCCT 1930 



10 GTTCACAGCC CTAGCCATAC TCACTCCTGC ACTCATCCTC CCTGCAACCC AGGCCCTGCC 1990 



ATTTTrcrCT ACCCrcnTG TATTGGCCAT TTCCrCGATT GTCCCACXTTC CAnTCCCTT 2050 



TCAGATTCCC TGGCAOTTCr AATGGTITCC TCTITACCCrC CCCCTCX5TGC CG 



2102 



15 



(2) INFORMATION FOR SEQ ID NO: 2: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



(ii) MOLECOLE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 



10 



15 



30 



Gin Tyrr Leu His Val Thr Leu Leu Ala Leu Pro lie Leu Gly lie 

20 25 30 
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Asn His Asn Lreu Leu Gin lie Phe Thr Ala Thr Thr Pro Asp His His 

35 40 45 



Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
50 55 60 



Gly Pro Asn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 
65 70 75 80 



10 Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 

85 90 95 



Asp Gly Trp lie Tyr Asn Ser Thr Arg Asp Thr lie Val Thr Glu Trp 

100 105 110 

15 

Asp Leu Val Cys Gly Ser Asn Lys Leu Lys Glu Met Ala Gin Ser Val 
115 120 125 



Phe Met Ala Gly lie Leu Val Gly Gly Pro Val Phe Gly Glu Leu Ser 
20 130 135 140 



Asp Arg Phe Gly Arg Lys Pro lie Leu Thr Trp Ser Tyr Leu Leu Leu 
145 150 155 160 



25 Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 

165 170 175 



Met lie Phe Arg Phe Leu Cys Gly Cys Ser lie Ser Gly lie Ser Leu 

180 185 190 

30 

Ser Thr lie lie Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg Ala 
195 200 205 
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Ile Ser Ser Thr Thr lie Gly Tyr Cys Tyr Thr lie Gly Gin Phe lie 
210 215 220 



Lieu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 
5 225 230 235 240 



Ser Val Ser Ala Ala Phe Phe lie Phe Ser Leu Leu Ser Trp Trp Val 

245 250 255 



10 Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 

260 265 270 



Leu Lys Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 
275 280 285 

15 

Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 
290 295 300 



lie Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
20 305 310 315 320 



Ser lie Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 

325 330 335 



25 Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 

340 345 350 



Val Asn lie Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp lie Pro 
355 360 365 

30 

Ala Lys Phe lie Thr lie Leu Ser lie Ser Tyr Leu Gly Arg Arg lie 
370 375 380 
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Thr Gin Gly Phe Leu Leu lie Leu Ala Gly Val Ala lie Ijeu Ala Leu 
385 390 395 400 



lie Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Val 

405 410 415 



Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 

420 425 430 



10 Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly lie 

435 440 445 



Ser Asn lie Trp Ala Pjcg Val Gly Ser Met He Ala Pro Leu Val Lys 
450 455 460 

15 

He Thr Gly Glu Leu Gin Pro Phe He Pro Asn Val He Phe Trp Thr 
465 470 475 480 



Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 
20 485 490 495 



Asn Arg Pro Leu Pro Glu Thr He Glu Asp He Gin Asp Trp Tyr Gin 

500 505 510 



25 Gin Thr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin 

515 520 525 



Thr He Pro Leu Lys Thr Gly Gly Pro 
530 535 

30 



(2) INFORMATICOSI FOR SBQ ID NO: 3: 



JNSDCX:tD: <WO 9742321A1 I > 
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(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 2135 base pairs 

(B) lYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOIHETICAL : NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hotro sapiens 

15 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1758 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GAATTCCGGT GATCACCAGC CCCATCGGAT CCAGACCCGG CCACCAGCTC TGGCTCGTCT 60 

TGCCCCAGTG CC ATG ACC TTC TOG GAG ATC CTG GAC OGT GTG QGA AGC 108 
25 Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser 

15 10 

ATG GGC CAT TTC CAG TTC CTG CAT GTA GCC ATA CTG GGC CTC COG ATC 156 
Met Gly His Phe Gin Phe Leu His Val Ala lie Leu Gly Leu Pro lie 
30 15 20 25 

CTC AAC ATG GCC AAC CAC AAC CTG CTG CAG ATC TTC ACA GCC GCC ACC 204 
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Leu Asn Met Ala Asn His Asn Leu Leu Gin lie Phe Thr Ala Ala Thr 
30 35 40 



CCr GTC CAC CAC TGT CGC CCX3 CCC CAC AAT GCC TCC ACA GGG CCT TOG 252 
5 Pro Val His His Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp 
45 50 55 60 



GTG CTC CCC ATG GGC CCA AAT GGG AAG CCT GAG AGG TGC CTC CGT TTI 300 
Val Leu Pro Met Gly Pro Asn Gly Lys Pro Glu Arg Cys Leu Axg Phe 
10 65 70 75 



GTA CAT CCG CCC AAT GCC AGC CTG CCC AAT GAC ACC CAG AGG GCC ATG 348 
Val His Pro Pro Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met 

80 85 90 



GAG CCA TGC CTG GAT GGC TGG GTC TAG AAC AGC ACC AAG GAC TCC ATT 396 

Glu Pro Cys Leu Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser lie 

95 100 105 



20 GTG ACA GAG TOG GAC TTG GTG TGC AAC TCC AAC AAA CTG AAG GAG ATG 444 
Val Thr Glu Trp Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met 
110 115 120 



GCC CAG TCP ATC TTC ATG GCA GGT ATA CTG ATP GGA GGG CTC GTG CTT 492 
25 Ala Gin Ser He Phe Met Ala Gly He Leu He Gly Gly Leu Val Leu 
125 130 135 140 



GGA GAC CTG TCP GAC AGG TTT GGC CGC AGG CCC ATC CTG ACC TGC AGC 540 
Gly Asp Leu Ser Asp Arg Phe Gly Arg Arg Pro He Leu Thr Cys Ser 
30 145 150 155 



TAC CTG CTG CTG GCA GCC AGC GGC TCC GGT GCA GCC TTC AGC CCC ACC 588 
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Tyr Leu Leu Leu Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr 

160 165 170 



TTC CCC ATC TAG ATG GTC TTC OGC TTC CTG TGT QGG TTT GGC ATC TCA 
5 Phe Pro lie Tyr Met Val Phe Arg Phe Leu Cys Gly Phe Gly lie Ser 

175 180 185 



GGC ATT ACC CTG AGO ACC GTC ATC TTG AAT GTG GAA TGG GTIG CCT ACC 
Gly lie Thr Leu Ser Thr Val lie Leu Asn Val Glu Trp Val Pro Thr 
10 190 195 200 



CGG ATG CGG GCC ATC ATG TCG ACA GCA CTC GGG TAC TGC TAC ACC TIT 
Arg Met Arg Ala lie Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe 
205 210 215 220 



GGC CAG TTC ATT CTG CCC GGC CTG GCC TAC GCC ATC CCC CAG TOG CGT 
Gly Gin Phe lie Leu Pro Gly Leu Ala Tyr Ala lie Pro Gin Trp Arg 

225 230 235 



20 TOG CTG CAG TTA ACT GTG TCC ATT CCC TTC TTC GTC TTC TTC CPA TCA 
Trp Leu Gin Leu Thr Val Ser lie Pro Phe Phe Val Phe Phe Leu Ser 

240 245 250 



TCC TOG TGG ACA CCA GAG TCC ATA OGC TOG TTG GTC TTG TCP GGA AAG 
25 Ser Trp Trp Thr Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys 

255 260 265 



TCC TCG AAG GCC CTG AAG ATA CTC CGG CGG GTT GGC TGT CIT CAA TGG 
Ser Ser Lys Ala Leu Lys lie Leu Arg Arg Val Gly Cys Leu Gin Trp 
30 270 275 280 



CAA GAA GGA AGA AGG AGA AAG CTC N2C TTG GAA GAG CTC AAA CTC AAC 
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Gin Glu Gly Arg Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn 
285 290 295 300 



CTG GAG AAG GAG ATC TCC TTG GCC AAG GCC AAT TAG ACC GCA AGT GAG 1020 
5 Leu Gin Lys Glu lie Ser Leu Ala Lys Ala Asn Tyr Thr Ala Ser Asp 

305 310 315 

CTG TTC CGG ATA CCC ATG CTG CGC CGC ATG ACC TTC TGT CTT TCC CTG 1068 
Leu Phe Arg lie Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu 
10 320 325 330 

GCC TGG TIT GCT ACC GGT TTT GCC TAC TAT AGT TTG GCT ATG GGT GTG 1116 
Ala Trp Phe Ala Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val 
335 340 34& 

15 

GAA GAA TTT GGA GTC AAC CTC TAC ATC CTC CAG ATC ATC TTT GGT GGG 1164 
Glu Glu Phe Gly Val Asn Leu Tyr lie Leu Gin lie lie Phe Gly Gly 
350 355 360 

20 GTC GAT GTC CCA GCC A7^ TTC ATC ACC ATC CTC TCC TTA AGC TAC CTG 1212 
Val Asp Val Pro Ala Lys Phe lie Thr He Leu Ser Leu Ser Tyr Leu 
365 370 375 380 



QGC CGG CAT ACC ACT CAG GCC GCT GCC CTG CTC CTG GCA GGA GOG GCC 1260 
25 Gly Arg His Thr Thr Gin Ala Ala Ala Leu Leu Leu Ala Gly Gly Ala 

385 390 395 

ATC TTG GCT CTC ACC TIT GTG CCC TTG GAC TTG CAG ACC GTG AGG ACA 1308 
He Leu Ala Leu Thr Phe Val Pro Leu Asp Leu Gin Thr Val Arg Thr 
30 400 405 410 

GTA TTG GCT GTG TIT GGG AAG GGA TGC CTA TCC AGC TCC TTC AGC TGC 1356 
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Val Leu Ala Val Phe Gly Lys Gly Cys Leu Ser Ser Ser Phe Ser Cya 
415 420 425 



CTC TTC CTC TAG ACA AGT GAA ITA TAG GCC ACA GTG ATG AGG CAA ACA 1404 
5 Leu Phe Leu Tyr Thr Ser Glu Leu Tyr Pro Thr Val lie Arg Gin Thr 
430 435 440 



GGT ATG GGC GTA AGT AAC CTG TGG ACC CGC GTG QGA AGC ATG GTG TCC 1452 
Gly Met Gly Val Ser Asn Leu Trp Thr Arg Val Gly Ser Met Val Ser 
10 445 450 455 460 



CCG CTG GTG AAA ATC AGG GGT G?sJ3 GTA GAG CCC TTC ATC CCC AAT ATC 1500 
Pro Leu Val Lys lie Thr Gly Glu Val Gin Pro Phe lie Pro Asn lie 

465 470 475 



ATC TAG GGG ATC ACC GCC CTC CTC GGG GGC AGT GCT GCC CTC TTC CTG 1548 
lie Tyr Gly lie Thr Ala Leu Leu Gly Gly Ser Ala Ala Leu Phe Leu 

480 485 490 



20 CCr GAG ACC CTTG AAT CAG CCC TTG CCA GAG ACT ATC GAA GAC CTG GAA 1596 
Pro Glu Thr Leu Asn Gin Pro Leu Pro Glu Thr lie Glu Asp Leu Glu 
495 500 505 



AAC TGG TCC CTG CGG GCA AAG AAG CCA AAG CAG GAG CCA GAG GTG GAA 1644 
25 . Asn Trp Ser Leu Arg Ala Lys Lys Pro Lys Gin Glu Pro Glu Val Glu 
510 515 520 



AAG GCC TCC CAG AGG ATC CTG TAG AGC CTC AGG GAC CAG GCC TGG GCT 1692 
Lys Ala Ser Gin Arg lie Leu Tyr Ser Leu Thr Asp Gin Ala Trp Ala 
30 525 530 535 540 



CCA GCT GAG GAC AAC GGA ACC CCC TIT CCC TGC CCT CCA GAG ACT GAT 1740 
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Pro Ala Glu Asp Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp 



545 



550 



555 



CCr AGC GAG GCA CCT TAGGAGTATA GGGAGGCCCC ATATAGGTCC ATCCTCCTAG 



1795 



Pro Ser Gin Ala Pro 



560 



]0 



15 



20 



25 



GATGAAGCCT TCTGAGAGCT TGGTGAT^GGT GTCTCCATCA CCACCACCAG AGCCTCCTGC 1855 

CCAGCCCTGG CCACTTCAAA GGTTCAGCCA TCCCTGCCCT TGriTCTTCCCTr GCAACCCAQG 1915 

CCCTGCCATT CTTCTGTCrA GCCCTTCCCC ACTGGCCACC TTCCCCCACT GTCCCX3GTCC 1975 

TCirCCCCTG AGGrrCCCCTG ATATCCCCTG GCTCAGTCCT AAC?^AGACTG AGTCTTAACA 2035 

AGATGAGAAG TCCTCCCCTT CTTGCCTCCX: ACACimCT TTGATOGGAG GTITCAATAA 2095 

ACAGOGATAA GAACTCTAAA AAAAAAAAAA ACCGGAATTC 2135 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHTVRACTERISTICS : 

(A) LENGTH: 561 amino acids 

(B) TYPE: amino acid 
(D) TOPOIjOGY: linear 

(ii) MOLECUIiE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly His Phe 
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1 5 10 15 



Gin Phe Leu His Val Ala lie Leu Gly Leu Pro lie Leu Asn Met Ala 

20 25 30 



Asn His Asn Leu Leu Gin lie Phe Thr Ala Ala Thr Pro Val His His 

35 40 45 



Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp Val Leu Pro Met 
10 50 55 60 



Gly Pro Asn Gly Lys Pro Glu Arg Cys Leu Arg Phe Val His Pro Pro 
65 70 75 80 



15 Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met Glu Pro Cys Leu 

85 90 95 



Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser lie Val Thr Glu Trp 

100 105 110 

20 

Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met Ala Gin Ser lie 

115 120 125 



Phe Met Ala Gly lie Leu lie Gly Gly Leu Val Leu Gly Asp Leu Ser 
25 130 135 140 



Asp Arg Phe Gly Arg Arg Pro lie Leu Thr Cys Ser Tyr Leu Leu Leu 
145 150 155 160 



30 Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr Phe Pro lie Tyr 

165 170 175 
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Met Val Phe Arg Phe Leu Cys Gly Phe Gly lie Ser Gly lie Thr Leu 

180 185 190 



Ser Thr Val lie Leu Asn Val Glu Trp Val Pro Thr Arg Met Arg Ala 
195 200 205 



lie Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe Gly Gin Phe lie 
210 215 220 



10 Leu Pro Gly Leu Ala Tyr Ala lie Pro Gin Trp Arg Trp Leu Gin Leu 
225 230 235 240 



Thr Val Ser lie Pro Phe Phe Val Phe Phe Leu Ser Ser Trp Trp Thr 

245 250 255 

15 

Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Ser Ser Lys Ala 

260 265 270 



Leu Lys lie Leu Arg Arg Val Gly Cys Leu Gin Trp Gin Glu Gly Arg 
20 275 280 285 



Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn Leu Gin Lys Glu 
290 295 300 



25 lie Ser Leu Ala Lys Ala Asn Tyr Thr Ala Ser Asp Leu Phe Arg lie 
305 310 315 320 



Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 

325 330 335 

30 

Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 

340 345 350 
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Val Asn Leu Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp Val Pro 

355 360 365 



Ala Lys Phe lie Thr lie Leu Ser Leu Ser Tyr Leu Gly Arg His Thr 
5 370 375 380 



Thr Gin Ala Ala Ala Leu Leu Leu Ala Gly Gly Ala lie Leu Ala Leu 

385 390 395 400 



10 Thr Phe Val Pro Leu Asp Leu Gin Thr Val Arg Thr Val Leu Ala Val 

405 410 415 



Phe Gly Lys Gly Cys Leu Ser Ser Ser Phe Ser Cys Leu Phe Leu Tyr 

420 425 430 

15 

Thr Ser Glu Leu Tyr Pro Thr Val lie Arg Gin Thr Gly Met Gly Val 
435 440 445 



Ser Asn Leu Trp Thr Arg Val Gly Ser Met Val Ser Pro Leu Val Lys 
20 450 455 460 



lie Thr Gly Glu Val Gin Pro Phe lie Pro Asn lie lie Tyr Gly lie 
465 470 475 480 



25 Thr Ala Leu Leu Gly Gly Ser Ala Ala Leu Phe Leu Pro Glu Thr Leu 

485 490 495 



Asn Gin Pro Leu Pro Glu Thr lie Glu Asp Leu Glu Asn Trp Ser Leu 

500 505 510 

30 

Arg Ala Lys Lys Pro Lys Gin Glu Pro Glu Val Glu Lys Ala Ser Gin 
515 520 525 
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Arg lie Leu Tyr Ser Leu Thr Asp Gin Ala Trp Ala Pro Ala Glu Asp 
530 535 540 

Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp Pro Ser Gin Ala 
5 545 550 555 560 

Pro 

10 

(2) INFORMATION FOR SEQ ID lsD:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUIiE TYPE: protein 

20 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

25 (v) EEUOIEOT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Arg Phe Gly Arg Lys 
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(2) INFORMATION FOR SBQ ID NO: 6: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STE^ANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
15 (iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa at position 2 is Arg or 
Lys" 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "Xaa at position 5 is Arg or 
Lys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Asp Xaa Xaa Arg Xaa 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
10 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



15 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



T^TTCATCA CCATCCTCTC CTTAAGCTAC CTOGGCOQGC ATACCACTCA GGCCGCTOCC 60 
25 TGCTGCrtXSGC AGGAGGGGCC ATCTTOGCTC TCACCmTG CCCTTOGACr TCCAGACCCT 120 



GAGACAGTAT TQGCTGTGTr TGGGAAGGGA TGCCTATCCA GCrCCFICAG CKSCCrCITC 180 



CTCTACACAA CTGAATTATA CCCCAGAGTC ATCAGGCAAA CAGGTATCGG CGTAAGTAAC 240 

30 

CTGTGGACCC GCGTGGGAAG CATGGTGTCC CGCTGGTGAA AATCAOGGGT GAGGTACAGC 300 
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CCTTCATCCC CAATATCATC TACGGGAT 



328 



NSDOCID: <WO 9742321 A1_L> 
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CLAIMS 



1 . An isolated nucleic acid molecule 

(a) which hybridizes under stringent conditions to a molecule consisting of the nucleic 
acid sequence of SEQ ID NO:l or SEQ ID NO;7 and which codes for an osteoclast transporter 
molecule, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 



2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
hybridizes under stringent conditions to a molecule consisting of the nucleic acid sequence of 
SEQ ID NO:3. 



15 3. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO:3. 



4. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO: 1. 



5. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 
fragment of SEQ ID NO:l between 12 and 2101 nucleotides in length, (b) a unique fragment of 
nucleotides 1-21 12 of SEQ ID NO:3 between 12 and 21 1 1 nucleotides in length, (c) 
complements of "(a)** and (d) complements of "(b)". 



6. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of SEQ ID NO:l between 12 and 1974 nucleotides in length. 

7. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
30 is a unique fragment of nucleotides 120-1 733 of SEQ ID NO: I between 1 2 and 1 612 nucleotides 

in length. 
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8. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 1-1 179 of SEQ ID 
NO:3 between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512- 
2112 of SEQ ID NO:3 between 12 and 599 nucleotides in length. 

5 

9. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of nucleotides 73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides 
in length. 

10 10. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 73-1179 of SEQ ID 
NO:3 between 12 and 1 105 nucleotides in length and a unique fragment of nucleotides 1512- 
1758 of SEQ ID NO:3 between 12 and 245 nucleotides in length. 

15 II. The isolated nucleic acid molecule of any of claims 5-10, wherein the isolated nucleic 
acid molecule is selected from the group consisting of a unique fragment of at least 14 
contiguous nucleotides, a unique fragment of at least 1 5 contiguous nucleotides, a unique 
fragment of at least 16 contiguous nucleotides, a unique fragment of at least 17 contiguous 
nucleotides, a unique fragment of at least 18 contiguous nucleotides, a unique fragment of at 

20 least 20 contiguous nucleotides and a unique fragment of at least 22 contiguous nucleotides. 

12. The isolated nucleic acid molecule of any of claims 5-10, wherein the isolated nucleic 
acid molecule consists of between 12 and 32 contiguous nucleotides. 

25 13. A host cell transformed or transfected with an expression vector comprising the isolated 
nucleic acid molecule of any of claims 1 , 2, 3 or 4 operably linked to a promoter. 

14. An isolated polypeptide coded for by the isolated nucleic acid molecule of any of claims 
1, 2, 3 or 4 . 

30 

15. An isolated polypeptide selectively binding a protein coded for by the isolated nucleic 
acid molecule of any of claims 1 , 2, 3 or 4 . 
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16. The isolated polypeptide of claim 15, wherein the isolated polypeptide is an Fab or F(ab) 
fragment of an antibody. 

17. Tlie isolated polypeptide of claim 15 wherein the isolated polypeptide is a fragment of an 
5 antibody, the fragment including a CDR3 region selective for the protein. 

18. The isolated polypeptide of claim 1 5, wherein the isolated polypeptide is a monoclonal 
antibody, 

10 19. The isolated polypeptide of claim 15, 16 or 17, wherein the isolated polypeptide 
selectively binds an extracellular portion of the protein. 

20. A method for decreasing osteoclast activity in a subject comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
15 isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease osteoclast activity in said subject. 

21. A method as claimed in claim 20, wherein the agent is a modified nucleic acid. 
20 22. A method as claimed in claim 20, wherein the agent is a polypeptide. 

23. The use of an agent that selectively binds to an isolated nucleic acid molecule of claim 1 , 
or an expression product thereof, in the preparation of a medicament. 

25 24. The use £is claimed in claim 23, wherein the agent is a modified nucleic acid. 
25. The use as claimed in claim 23, wherein the agent is a polypeptide. 
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OSTEOCLAST TRANSPORTER PROTEIN 

Government Support 

This work was funded in part by the NIDDK under Grant Number ROl DK45639. The 
5 United States government may have certain rights to this invention. 

Field of the Invention 

This invention relates to osteoclast transporter genes, proteins coded for by such genes, 
and diagnostics and therapeutics related to medical conditions associated with such genes and 
1 0 proteins, including osteoporosis and osteopetrosis. 

Background of the Invention 

The remodeling of bone is a dynamic process. Cells continuously lay down and resorb 
bone material. An imbalance in the activity of cells that lay down new bone (osteoblasts) and 

15 cells that resorb bone (osteoclasts) can result in serious, and sometimes even fatal, disorders. 

Osteoporosis is a term used for a number of diseases of diverse etiology, all involving a 
reduction in the mass of bone per unit volume. Osteoporosis is the most common of the 
metabolic bone diseases. Twenty-five million people in the United States and more than two 
hundred million people worldwide are affected by osteoporosis. Osteoporosis is frequent among 

20 post-menopausal women and is an important cause of morbidity in the elderly. It commonly, 
results in bone fractures, and death can be a frequent occurance in the months following 
fractures, particularly those of the hip in elderly individuals. 

Osteopetrosis is a disorder involving an increase in the mass of bone per imit volume. Its 
incidence is rare compared to osteoporosis, but it typically is life threatening. Despite having 

25 multiple causes, a defect in bone resorption is always the underlying mechanism. In many 
instances, the disorder is inherited as an autosomal recessive trait and involves abnormal 
osteoclast function. Bone marrow transplants from normal donors have been attempted to 
restore normal osteoclast precursor cells, but this therapy has shown only limited success. 
Present treatments for osteoporosis and osteopetrosis are inadequate. 

30 There exists a need to influence favorably the bone remodeling process to treat 

osteoporosis and osteopetrosis. There also exists a need to identify the gene(s) responsible for 
osteopetrosis and to provide a genetic therapy for treating osteopetrosis. 
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An object of the invention is to provide compounds that desirably influence the bone 
remodeling process. 

Another object of the invention is to provide therapeutics for treating osteoporosis. 
Another object of the invention is to provide therapeutics for treating osteopetrosis. 
Still another object of the invention is to provide diagnostics and research tools relating to 
osteoporosis and osteopetrosis. These and other objects will be described in greater detail below. 

Smnma rv of the Inventioi^ 

The invention involves in one respect the discovery of osteoclast transporter proteins and 
nucleic acid molecules encoding those proteins. The expression and biological activity of the 
proteins are necessary for normal osteoclast function, and alteration of the expression or 
biological activity of these proteins can be used to influence osteoclast activity and thereby affect 
bone remodeling. In addition, normal osteoclast function can be established in abnormal 
osteoclasts lacking a normal osteoclast transporter protein by supplying to the abnormal 
osteoclast a nucleic acid expressing a functional osteoclast transporter protein. 

The preferred nucleic acids of the invention are homologs and alleles of the nucleic acids 
of SEQ ID NO:l and SEQ ID NO:3. The invention further embraces functional equivalents, 
variants, analogs and fragments of the foregoing nucleic acids and also embraces proteins and 
peptides coded for by any of the foregoing. 

According to one aspect of the invention, an isolated nucleic acid molecule is provided. 
The molecule hybridizes imder stringent conditions to a molecule consisting of the nucleic acid 
sequence of SEQ ID NO: 1 or SEQ ID NO:7 and it codes for an osteoclast transporter molecule. 
The invention further embraces nucleic acid molecules that differ from the foregoing isolated 
nucleic acid molecules in codon sequence due to the degeneracy of the genetic code. The 
invention also embraces complements of the foregoing nucleic acids. 

Preferred isolated nucleic acid molecules are those which hybridize xmder stringent 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID NO:3, 
particularly those comprising the human cDNAs or gene corresponding to SEQ ID NO:3 and the 
cDNAs or gene corresponding to SEQ ID NO:l. 

The invention in another aspect involves expression vectors, and host cells transformed or 
transfected with such expression vectors, comprising the nucleic acid molecules described above. 



9742321A1JB> 



i.' 



WO 97/42321 PCTAJS97/07856 

-3 - 

In one embodiment of the invention, the host cell is a hematopoietic osteoclast precursor cell, 
such as a stem cell. 

According to another aspect of the invention, an isolated nucleic acid molecule is 
provided which comprises a unique fragment of SEQ ID NO:l between 12 and 2101 nucleotides 
5 in length or a unique fragment of nucleotides 1-21 12 of SEQ ID NO:3 between 12 and 2111 
nucleotides in length, and complements thereof. In one embodiment, the unique fragment is at 
least 150 and more preferably at least 200 nucleotides in length. In another embodiment, the 
unique fragment is between 12 and 32 contiguous nucleotides in length. In some embodiments, 
the isolated nucleic acid molecule is a unique fragment of SEQ ID NO:l between 12 and 1974 

10 nucleotides in length or a unique fragment of nucleotides 120-1733 of SEQ ID NO:l between 12 
and 1612 nucleotides in length. In another embodiment, the isolated nucleic acid molecule is 
selected from the group consisting of a unique fragment of nucleotides 1-1 179 of SEQ ID NO:3 
between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512-2112 of 
SEQ ID NO:3 between 12 and 599 nucleotides in length. In certain embodiments, the isolated 

15 nucleic acid molecule is a unique fragment of the coding region of the human osteoclast 

transporter gene. Such isolated nucleic acid molecule include unique fragments of nucleotides 
73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides in length. Preferably, unique 
fragments of the coding region include isolated nucleic acid molecules selected from the group 
consisting of unique fragments of nucleotides 73-1 1 79 of SEQ ID NO:3 between 12 and 1 105 

20 nucleotides in length and unique fragments of nucleotides 1512-1758 of SEQ ID NO:3 between 
12 and 245 nucleotides in length. 

According to another aspect of the invention, isolated polypeptides coded for by the 
isolated nucleic acid molecules described above also are provided as well as functional 
equivalents, variants, analogs and fragments thereof. In one embodiment, the polypeptide is a 

25 hxmian osteoclast transporter protein or a functionally active fragment or variants thereof. 

The invention also provides isolated polypeptides which selectively bind an osteoclast 
transporter protein or fragments thereof. Isolated binding polypeptides include antibodies and 
fragments of antibodies (e.g. Fab, F(ab)2, Fd and antibody fragments which include a CDR3 
region which binds selectively to the osteoclast transporter proteins of the invention). Preferred 

30 isolated binding polypeptides are those that bind an extracellular portion of the osteoclast 
transporter proteins of the invention. 
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The invention in another aspect involves a method for decreasing osteoclast activity in a 
subject. An agent that selectively binds to an isolated nucleic acid molecule as described above 
or an expression product thereof is administered to a subject in need of such treatment, in an 
amount effective to decrease osteoclast activity in the subject. In one embodiment, the agent 
5 selectively binds to an extracellular domain of the expression product. Preferred agents are 
modified antisense nucleic acids and polypeptides. 

The invention is a further aspect provides for the use of agents which bind to the 
foregoing nucleic acids and expression products thereof in the preparation of a medicament. 
Preferred agents are modified antisense nucleic acids and polypeptides. 
10 The invention also contemplates gene therapy for osteopetrosis, wherein defective stem 

cells of a donor are genetically engineered to include an isolated nucleic acid expressing a 
functional osteoclast transporter protein. The cells then are returned to the donor. 

In the same manner as described above kidney cell tubule activity may be modulated 
using the methods and products of the invention. 



15 



Brief Description of the Drawing 
FIG. 1 is a graph depicting hydrophobicity vs. amino acid for the mouse osteoclast 
transporter protein. 



20 Detailed Description of the Invention 

The present invention in one aspect involves the cloning of a gene encoding an osteoclast 
transporter protein. The sequence of the gene (from mouse) is presented as SEQ ID NO: 1 . and 
the predicted amino acid sequence of this gene's protein product is presented as SEQ ID NO:2. 
The mRNA transcript is about 2.0 kb in length. It was obtained from mouse kidney tubule cells. 

4 

25 A partial clone of the apparent human homolog was identified in GenBank, accession no. 

H20345, and is presented as SEQ ID NO:7. The partial clone is available from American Type 
Culture Collection Depositor)'. Rockville, Maryland. The human cDNA encoding the osteoclast 
transporter protein was isolated by a standard hybridization protocol using probes consisting of 
the 5' end of the mouse gene (nucleotides 45-357 of SEQ ID NO: 1) and the partial hiunan gene 

30 sequence (SEQ ID NO:7). The sequence of the full length human cDNA is presented as SEQ ID 
NO:3, and the predicted amino acid sequence of the human protein product is presented as SEQ 
ID NO:4. 
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The mouse gene maps to the proximal portion of chromosome 19, in a region to which 
the osteosclerosis (oc) mutation has been assigned previously. The phenotype of oc/oc mutant 
mice includes osteopetrosis, and the osteoclasts of these mutant mice, although present, appear to 
be nonfunctional. The osteoclast transporter protein of the present invention is not expressed in 

0' 

5 oc/oc mutant mice, although Southern analysis indicates that the transporter locus is 

unrearranged and intact in these mutant mice. The osteoclast transporter protein of the present 
invention, however, is expressed in normal osteoclasts, and it is believed that hereditary 
osteopetrosis results from mutations to this gene, which impair osteoclast function. 

The gene is not expressed in any other normal tissue tested, except in kidney tubule cells 

10 where its function is believed to be redundant. In particular. Northern analysis demonstrated that 
the gene is not expressed in normal skeletal muscle, pancreas, lung, heart, brain, stomach, spleen, 
salivary gland, thymus, liver, large intestine or small intestine tissue. 

Searches of GenBank for similar related proteins shows that this protein shares some very 
limited, localized homology and sequence motifs to known proteins with transport functions. 

15 The protein of the invention, however, does have a length and tertiary structures similar to 
known transporters. 

Specifically, it appears to be a member of a large family of genes witli transport functions 
called the "major facilitator superfamily" by Marger and Saier, which include a number of xmi-, 
sym- and antiporters specific for sugars, organic acids and drugs. The common structural motif 

20 of this family are 12 transmembrane alpha helices, and often includes a central cytoplasmic loop 
between the 6th and 7th membrane-spanning domains. The hydrophobicity plot for the protein 
of the present invention is shown in Fig. 1. Each asterisk indicates a potential membrane 
spanning hydrophobic domain. There appears to be an extracellular loop between the first and 
second hydrophobic domains, an intra-cytoplasmic loop between the hydrophobic domains 6 and 

25 7, and a 3* intra-cytoplasmic tail: The 5* end also appears to be intemal within the cytoplasm. 

The transporter also contains amino-acid motifs found in this class of molecules, 
including a sequence, D-R-F-G-R-K (SEQ ID NO:5), similar to a D-R/K-X-R-R/K sequence 
(SEQ ID NO:6) after the 2nd membrane domain, and a P-E-S/T sequence found after the 12th 
transmembrane domain. For genes with known function, the transporter is most closely related 

30 to the polyspecific rat cation transporter OctI (30% amino acid identity and 50% similarity), 
which has been shown to transport a variety of organic cationic drugs (Grundemann et al.. 
Nature, Vol. 372, December 1 994.) 



BNSDOCID: <WO_9742321A1JB> 



wo 97/42321 PCTAJS97/07856 

-6- 

The invention thus involves osteoclast transport proteins, genes encoding those proteins, 
functional modifications and variants of the foregoing, useful fragments of the foregoing, as well 
as therapeutics and diagnostics relating thereto. 

Homologs and alleles of the osteoclast transporter genes of the invention can be identified 
5 by conventional techniques. Thus, an aspect of the invention is those nucleic acid sequences 
which code for osteoclast transporter proteins and which hybridize to a nucleic acid molecule 
consisting of SEQ ID NO:l or SEQ ID NO:3, under stringent conditions. The term "stringent 
conditions" as used herein refers to parameters with which the art is familiar. More specifically, 
stringent conditions, as used herein, refers to hybridization at 65°C in hybridization buffer (3.5 x 

10 SSC, 0.02% Ficoll, 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 2.5mM 

NaH2P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0. 1 5M sodium chloride/0. 1 5M sodium citrate, 
ph7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediaminetetracetic acid. After 
hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC at room 
temperature and then at 0.1 x SSC/0.1 x SDS at 65^C. 

15 There are other conditions, reagents, and so forth which can used, which result in a 

similar degree of stringency. The skilled artisan will be familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permit the clear identification of homologs and alleles 
of osteoclast transporter proteins of the invention. The skilled artisan also is familiar with the 

20 methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NOs:l or 3 and SEQ ID NOs:2 or 4, 
respectively, in some instances will share at least 50% nucleotide identity and/or at least 65% 

25 amino acid identity and in still other instances will share at least 60% nucleotide identity and/or 
at least 75% amino acid identity. Watson-Crick complements of the forgoing nucleic acids also 
are embraced by the invention. 

In screening for osteoclast transporter protein family members, a Southern blot may be 
performed using the foregoing conditions, together with a radioactive probe. After washing the 

30 membrane to which the DNA is finally transferred, the membrane can be placed against x-ray 
film to detect the radioactive signal. 
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The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 
5 the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in v/vo, to incorporate a serine residue. Similarly, nucleotide sequence triplets which 
encode other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT 
(proline codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG 
and ACT (threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT 

10 (isoleucine codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The invention also provides isolated. unique fragments of SEQ ID NO:l, SEQ ID NO:3, 
or compliments of SEQ ID NO: 1 or SEQ ID NO:3. A unique fragment is one that is a 

15 'signature* for the larger nucleic acid. It, for example, is long enough to assure that its precise 
sequence is not found in molecules outside of the osteoclast transporter protein family as defined 
by claim 1 . Unique fragments can be used as probes in Southern blot assays to identify family 
members or can be used in amplification assays such as those employing PCR. As known to 
those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain 

20 uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. . 
Unique fragments also can be used to produce fusion proteins for generating antibodies or for 
generating immxmoassay components. Likewise, unique fragments can be employed to produce 
fragments of the osteoclast transporter protein such as only the extracellular portion, useful, for 
example, in inmnmoassays or as a competitive inhibitor of the substrate of the osteoclast 

25 transporter protein in therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of the osteoclast transporter proteins of the invention, 
particularly for therapeutic purposes as described in greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l and SEQ 

30 ID NO:3, and their complements, will require longer segments to be unique while others will 
require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Virtually any segment of 
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SEQ ID NO:l or SEQ ID NO;3, or their complements, that is 1 8 or more nucleotides in length 
will be unique. Those skilled in the art are well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the sequence 
of interest from non-family members. A comparison of the sequence of the fragment to those on 
5 known data bases typically is all that is necessary, although in vitro confirmatory hybridization 
and sequencing analysis may be performed. 

As mentioned above, the invention embraces antisense oligonucleotides that selectively 
bind to a nucleic acid molecule encoding an osteoclast transporter protein, to decrease osteoclast 
activity. This is desirable in virtually any medical condition wherein a reduction in osteoclast 

10 activity is desirable, including when a reduction in bone loss or an increase in bone mass is 
desired. By decreasing the osteoclast activity in a subject, bone remodeling thus can be 
favorably affected. Antisense molecules, in this manner, can be used to slow down or arrest the 
loss in bone mass occurring with certain forms of osteoporosis and may even result in an 
increase in bone mass in circumstances where an increase in bone mass is desirable. 

15 As used herein, the term "antisense oligonucleotide" or "antisense" describes an 

oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mRN A transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The 

20 antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene. Those skilled in the art will recognize that the 
exact length of the antisense oligonucleotide and its degree of complementarity with its target 
will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 

25 be constracted and arranged so as to bind selectively with the target under physiological 

conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell xmder physiological conditions. Based upon SEQ ID NO:l and SEQ ID NO:3, 
or upon allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily 
choose and synthesize any of a number of appropriate antisense molecules for use in accordance 

30 with the present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target. Most preferably, the antisense 
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oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal or 
5* upstream sites such as translation initiation, transcription initiation or promoter sites. In 
5 addition, 3 '-untranslated regions or telomerase sites may be targeted. Targeting to mRNA 

splicing sites has also been used in the art but may be less preferred if alternative mRNA splicing 
occurs. In addition, the antisense is targeted, preferably, to sites in which mRNA secondary 
structure is not expected (see, e.g., Sainio et al.. Cell Mol. Neurobiol. 14(5):439-457 (1994)) and 
at which proteins are not expected to bind. Finally, although, SEQ ID NOs:l and 3 disclose 

10 cDNA sequences, one of ordinary skill in the art may easily derive the genomic DNAs 

corresponding to the cDNAs of SEQ ID NOs:l and 3. Thus, the present invention also provides 
for antisense oligonucleotides which are complementary to the genomic DNAs corresponding to 
SEQ ID NOs:l and 3. Similarly, antisense to allelic or homologous cDNAs and genomic DNAs 
are enabled without undue experimentation. 

15 In one set of embodiments, the antisense oligonucleotides of the invention may be 

composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. That 
is, the 5* end of one native nucleotide and the 3* end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out manually 

20 or by an automated sj nthesizer. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target but which enhance 
their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

25 The term "modified oligonucleotide" as used herein describes an oligonucleotide in 

which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5* end of one nucleotide 
and the 3* end of anotlier nucleotide) and/or (2) a chemical group not normally associated with 
nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 

30 intemucleoside linkages include phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkylphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates. and carboxymethyl esters. 



3NSDOCID: <WO__9742321A1_IB> 



wo 97/42321 PCT/US97/07856 

- 10- 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular weight 
organic groups other than a hydroxy 1 group at the 3' position and other than a phosphate group at 
5 the 5* position. Thus modified oligonucleotides may include a 2'-0-alkylated ribose group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. The 
present invention, thus, contemplates pharmaceutical preparations containing modified antisense 
molecules that are complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding osteoclast transporter proteins, together with pharmaceutically acceptable 
1 0 carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical composition 
or medicament. Such a pharmaceutical composition or medicament may include the antisense 
oligonucleotides in combination with any standard physiologically and/or pharmaceutically 
acceptable carriers which are known in the art. The compositions should be sterile and contain a 

15 therapeutically effective amount of the antisense oligonucleotides in a unit of weight or volume 
suitable for administration to a patient. The term "pharmaceutically acceptable" means a non- 
toxic material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 

20 characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art. 

The invention also involves expression vectors coding for osteoclast transporter proteins 
and fi*agments and variants thereof and host cells containing those expression vectors. Virtually 

25 any cells, prokaryotic or eukaryotic, which can be transformed with heterologous DNA or RNA 
and which can be grown or maintained in culture, may be used in the practice of the invention. 
Examples include bacterial cells such as E. coli and mammalian cells such as mouse, hamster, 
pig> goat, primate, etc. They may be of a wide variety of tissue types, including mast cells, 
fibroblasts, oocytes and lymphocytes, and they may be primary cells or cell lines. Specific 

30 examples include CHO cells and COS cells. Cell-free transcription systems also may be used in 
lieu of cells. In gene therapy applications, human hematopoietic cells that are precursors of 
osteoclasts are contemplated. 
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As used herein, a "vector" may be any of a number of nucleic acids into which a desired 
sequence may be inserted by restriction and ligation for transport between different genetic 
environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. 
A cloning vector is one which is able to replicate in a host cell, and which is further characterized 
by one or more endonuclease restriction sites at which the vector may be cut in a determinable 
fashion and into which a desired DNA sequence may be ligated such that the new recombinant 
vector retains its ability to replicate in the host cell. In the case of plasmids, replication of the 
desired sequence may occur many times as the plasmid increases in copy number within the host 
bacteriimi or just a single time per host before the host reproduces by mitosis. In the case of 
phage, replication may occur actively during a lytic phase or passively during a lysogenic phase. 
An expression vector is one into which a desired DNA sequence may be inserted by restriction 
and ligation such that it is operably joined to regulatory sequences and may be expressed as an 
RNA transcript. Vectors may further contain one or more marker sequences suitable for use in 
the identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose 
activities are detectable by standard assays known in the art (e.g. B-galactosidase or alkaline 
phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, 
hots, colonies or plaques. Preferred vectors are those capable of autonomous replication and 
expression of the structural gene products present in the DNA segments to which they are 
operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or transcription 
of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5* regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the 
ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 
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capable of effecting transcription of that DNA sequence such that the resulting transcript might 
be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5* non-transcribing and 
5 5' non-translating sequences involved with the initiation of transcription and translation 

respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, 
such 5* non-transcribing regulatory sequences will include a promoter region which includes a 
promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences 
may also include enhancer sequences or upstream activator sequences as desired. The vectors of 

10 the invention may optionally include 5' leader or signal sequences, 5' or 3'. The choice and design 
of an appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are commercially 
available and known to those skilled in the art. See Sanbrook et ah. Molecular Cloning: A 
Laboratorv Manual . Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are 

15 genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
the osteoclast transporter protein or fragment or variant thereof. That heterologous DNA (RNA) 
is placed imder operable control of transcriptional elements to permit the expression of the 
heterologous DNA in the host cell. In still another aspect of the invention, a defective osteoclast 
or precursor thereof is treated with DNA in a manner to promote via homologous recombination 

20 intracellularly the correction of a defective osteoclast transporter gene. 

Preferred systems for niRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogen, San Diego, CA) that contain a selectable marker such as a gene that 
confers G418 resistance (which facilitates the selection of stably transfected cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 

25 expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an 
Epstein Ban* virus (EBV) origin of replication, facilitating the maintenance of plasmid as a 
multicopy extrachromosomal element. 

mRNA expression and transporter function can be tested using these vectors in a wide 
variety of mammalian cell lines. Preferred systems include cells derived from kidney tubules, 

30 including MDCK and IMCD-3 cells (both available from ATCC). 

A variety of systems for expression of proteins in bacterial, yeast, mammalian, or insect 
cells have been described and are commercially available. Preferred systems include the 
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Glutathione-S-transferase (GST) Gene Fusion system available from Phamiacia Biotech, 
Piscataway, NJ. In this system a plasmid is constructed containing the protein sequence of 
interest (in this case, the transporter including the first extracellular domain) inserted in frame 
downstream of the 25 kDa GST domain from S. japonicum . Expression of the fusion protein can 
5 be induced in transfected bacterial cells and the fusion protein purified by affinity 

chromatography using Glutathione Sepharose 4B. Cleavage of the desired peptide from the GST 
sequences is achieved using a site specific protease whose recognition sequence is located 
inmiediately upstream from the cloning site. An alternative system which is desirable since it 
maintains eucaryotic-specific functions such as glycosylation is recombination into baculovirus. 
10 Standard protocols exist (c.f. O'Reilly et al., Baculovirus Expression Vectors: A :Laboratory 
Manual, IRL/Oxford University Press, 1992) and vectors, cells, and reagents are commercially 
available. 

The invention also involves polypeptides which bind to osteoclast transporter proteins 
and in certain embodiments preferably to the extracellular domain of the proteins. Such binding 

1 5 partners can be used in screening assays to detect the presence or absence of the osteoclast 

transporter protein and in purification protocols to isolate osteoclast transporter proteins. Such 
binding partners also can be used to inhibit the native activity of the osteoclast transporter protein 
by binding to the extracellular domain of such proteins. Likewise, such binding partners can be 
used to selectively target drugs, toxins or other molecules to osteoclasts. In this manner, 

20 osteoclast activity may be selectively increased by drugs that would enhance osteoclast activity 
or selectively impaired by drugs that would inhibit osteoclast activity (e.g. toxins, antisense, 
etc.). In this manner, bone remodeling may be desirably affected. 

The invention, therefore, involves antibodies or fragments of antibodies having the ability 
to selectively bind to osteoclast transporter proteins, and preferably to the extracellular domain 

25 thereof. Antibodies include polyclonal and monoclonal antibodies, prepared according to 

conventional methodology. Antibodies were raised according to standard procedures* A fusion 
protein was prepared using the pGEX-4T-2 vector (Pharmacia, Piscataway, NJ). The fusion 
protein contains a glutathione-S -transferase (GST) fragment joined to the terminal 84 amino 
acids of the osteoclast transporter. The fusion protein was expressed and injected into rabbits to 

30 produce antibodies as rabbit immune serum. The antibodies so generated detected a protein of 
molecular weight which corresponds to the osteoclast transporter in tissue prepared from normal 
mouse kidneys. The protein is significantly reduced or absent in kidney tissue prepared from 
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oc/oc mutant mice, consistent with the absence of detectable osteoclast transporter mRNA by 
Northern blot in these mice. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 
5 W.R. (1986) The Experimental Foundations of Modem Immunology Wiley & Sons, Inc., New 
York; Roitt, I. (1991) Essential Immunology . 7th Ed., Blackwell Scientific Publications, 
Oxford). The pFc* and Fc regions, for example, are effectors of the complement cascade but arc 
not involved in antigen binding. An antibody from which the pFc' region has been enzymatically 
cleaved, or which has been produced without the pFc* region, designated an F(ab')2 fragment, 

10 retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 
the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further. Fab fragments consist of a covalently bpund antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 

15 determinant of antibody specificity (a single Fd fragment may be associated with up to ten 
different light chains without altering antibody specificity) and Fd fragments retain epitope- 
binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
complementarity determining regions (CDRs), which directly interact with the epitope of the 

20 antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, 
in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four framework regions (FRl through FR4) separated 
respectively by three complementarity determining regions (CDRl through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely 

25 responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 
the epitopic specificity of the original antibody. This is most clearly manifested in the 
development and use of "humanized" antibodies in which non-human CDRs are covalently 

30 joined to human FR and/or Fc/pFc* regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/04381 teaches the production and use of 
hxmianized murine RSV antibodies in which at least a portion of the murine FR regions have 
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been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab*)2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
5 and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 
human or non-human sequences; chimeric F(ab')2 fragment antibodies in which the FR and/or 
CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDRl and/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 
10 human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDRl and/or 
CDR2 regions have been replaced by homologous human or non-human sequences. The present 
invention also includes so-called single chain antibodies. Thus, the invention involves 
polypeptides, of numerous size and type that bind specifically to osteoclast transporter proteins. 
These polypeptides may be derived also from sources other than antibody technology. For 
15 example, such polypeptide binding agents can be provided by degenerate peptide libraries which 
can be readily prepared in solution, in immobilized form or as phage display libraries. 
Combinatorial libraries also can be synthesized of peptides containing one or more amino acids. 
Libraries further can be synthesized of peptoids and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides usefiil 
20 according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or lambda 
phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent a completely degenerate or biased array. One then can select phage- 
bearing inserts which bind to the osteoclast transporter protein. This process can be repeated 
through several cycles of reselection of phage that bind to the osteoclast transporter protein. 
25 Repeated rounds lead to enrichment of phage bearing particular sequences. DNA sequence 

analysis can be conducted to identify the sequences of the expressed polypeptides. The minimal 
linear portion of the sequence that binds to the osteoclast transporter protein can be determined. 
One can repeat the procedure using a biased library containing inserts containing part or all of the 
minimal linear portion plus one or more additional degenerate residues upstream or downstream 
30 thereof. Thus, the osteoclast transporter molecule of the invention, an extracellular domain 
thereof, or the like, can be used to screen peptide libraries, including phage display libraries, to 
identify and select peptide binding partners of the extracellular portion of the osteoclast 
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transporter protein of the invention. Such molecules can be used, as described, for screening 
assays, for diagnostic assays, for purification protocols, for interfering directly with the 
functioning of osteoclasts by binding to the osteoclast transporter protein or for targeting drugs, 
toxins and/or labeling agents (e.g. radioisotopes, fluorescent molecules, etc.) to osteoclasts 
5 and/or osteoclast transporter proteins. Drug molecules that would affect osteoclast activity and 
toxin molecules that would disable or destroy osteoclast cells are known to those skilled in the 
art and are commercially available. For example, the immunotoxin art provides examples of 
toxins which are effective when delivered to a cell by an antibody or fragment thereof. 
Examples of toxins include ribosome-damaging toxins derived from plants or bacterial such as 

1 0 ricin, abrin, saporin, Pseudomonas endotoxin, diphtheria toxin, A chain toxins, blocked ricin, etc. 

Osteoclast activity can be assayed both in vitro and in vivo. Heterogeneous culture 
systems in which bone marrow derived cells are co-cultured with osteoblast cells in the presence 
of 1,25-dihydroxyvitamin D3 will generate mature osteoclasts, whose activity can be quantitated 
by measuring their ability to form resorption pits on dentine slices (Takahashi et al., 

15 Endocrinology^ Vol. 122, No. 4, 1988). In mammals, decreased osteoclast function results in the 
clinical condition of osteopetrosis. In rodents, this can be readily scored by virtue of the fact 
that, in this case, incisors fair to erupt after birth. Thus, rodent neonates can be treated with 
compositions of the invention and the effect on osteoclast activity can be determined by scoring 
incisor eruption. The degree of osteopetrosis can be more accurately quantitated by 

20 morphometrical analysis of the bone marrow space (Boyce et aL, J. Clin, Invest.^ Vol. 90, 1992). 

When used therapeutically, the compounds of the invention are administered in 
therapeutically effective amounts. In general, a therapeutically effective amount means that 
amount necessary to delay the onset of, inhibit the progression of, or halt altogether the particular 
condition being treated. Therapeutically effective amounts specifically will be those which 

25 desirably influence osteoclast activity. When it is desired to decrease osteoclast activity, then 
any inhibition of osteoclast activity is regarded as a therapeutically effective amount. When it is 
desired to increase osteoclast activity, then any enhancement of osteoclast activity is regarded as 
a therapeutically effective amount. Generally, a therapeutically effective amount will vary with 
the subject's age, condition, and sex, as well as the nature and extent of the disease in the subject, 

30 all of which can be determined by one of ordinary skill in the art. The dosage may be adjusted 
by the individual physician or veterinarian, particularly in the event of any complication. A 
therapeutically effective amount typically varies from 0.01 mg/kg to about 1000 mg/kg. 
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preferably from about 0. 1 mg/kg to about 200 mg/kg and most preferably from about 0.2 mg//kg 
to about 20 mg/kg, in one or more dose administrations daily, for one or more days. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infiision over time. The administration may, for example, be 
5 oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 

10 capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's Pharmaceutical 

Sciences , 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

15 Preparations for parenteral administration include sterile aqueous or non-aqueous 

solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl, 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 

20 Ringer*s dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 

25 gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available docimients. In general, it involves introduction in 
vitro of a ftinctional copy of a gene into a cell(s) of a subject which contains a defective copy of 
the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of 
the gene is under operable control of regulatory elements which permit expression of the gene in 

30 the genetically engineered cell(s). Niunerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
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are described in PCX application WO95/00654. In vivo gene therapy using vectors such as 
adenovirus also is contemplated according to the invention. 

Thus, defective osteoclasts or precursors thereof are provided with a non-defective 
nucleic acid encoding an osteoclast transporter protein. In particular, such gene therapy is 
5 appropriate for hereditary forms of osteopetrosis involving defective osteoclast transporter 

protein genes. For example, primary himian blood cells which are precursors of osteoclasts can 
be obtained from the bone marrow of a subject who is a candidate for such gene therapy. 
Candidates can be identified by screening for abnormal osteoclast function that results from a 
defective osteoclast transporter protein. Then, such cells c£ui be genetically engineered ex vivo 

10 with DNA (RNA) encoding a normal osteoclast transporter protein. The genetically engineered 
cells then are returned to the patient. 

That the transporter can be used to correct the defect in the clinical conditions of 
osteopetrosis in which it has been inactivated is demonstrated using a mouse model system. The 
osteosclerosis (oc) mutant mouse fails to express the transporter and has osteopetrosis. The 

15 transporter is introduced into a vector such as pXTl (Strategene, La JoUa, CA) that, when 
transfected into an appropriate packaging cell line, will generate a higher titer of replication- 
defective retrovitral particles carrying the transporter under the regulatory control of a thymidine 
kinase promoter. Hematopoietic progenitor cells from oc mutant mice are prepared and, using 
well-documented techniques of infection and selection, stem cells carrying this vector are 

20 selected after coculture with the retrovirus. Re-population of lethally irradiated oc mutant mice 
with this transfected population is accomplished using documented protocols of hematopoietic 
transplantation, and the degree of amelioration of the osteopetrosis phenotype then is determined 
by radiography and morphometrical analysis. 

While the invention has been described with respect to certain embodiments, it should be 

25 appreciated that many modifications and changes may be made by those of ordinary skill in the 
art without departing from the spirit of the invention. It is intended that such modification, 
changes and equivalents fall within the scope of the following claims. 
A Sequence Listing is provided, followed by what is claimed: 
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SBQUENCE LISTING 



(1) GENERAL INFORMATION: 



5 (i) APPLICANT: 

(A) NAME: BRIGHAM AND WOMEN'S HOSPITAL, INC 

(B) STREETI: 75 FRANCIS STREETI 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

10 (E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02115 



(i) APPLICANT/ INVENTOR: 

(A) NAME: BEIER, DAVID R, 

15 (B) STREET: 39 ARLINGTON ROAD 

(C) CITY: BROOKLINE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02146 

20 

(i) APPLICANT/INVENTOR: 

(A) NAME: BRADY, KEVIN P. 

(B) STREET: 3N BENNET COURT 

(C) CITY: BOSTON 

25 (D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02113 



(ii) TITLE OF INVENTION: OSTEOCLAST TRANSPORTER 



(iii) NUMBER OF SEQUENCES: 7 
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(iv) CDRRESPOMDENCE ADDRESS: 

(A) ADDRESSEE: WOLF, GREENFIEIiD & SACKS, P.C. 

(B) STREET: 600 ATLANTIC AVENUE 

(C) CITY: BOSTON 

(D) STATE: MASSACEIUSEITS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02210 



(v) COMPUTER READABLE FORM: 
10 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC cornpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 



15 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUM 

(B) FILING DATE: 

(C) CLASSIFICATION; 



20 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/647,397 , 

(B) FILING DATE: 09-MAY-1996 

(viii) ATTORNEY/AGENT INFORMATION: 
25 (A) NAME: Gates, Edward R. 

(B) REGISTRATION NUMBER: 31,616 

(C) REFERENCE/DOCKET NUMBER: B0801/7048WO 

(ix) TELECOMMUNICATION INFORMATION: 
30 (A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 
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(2) INFORMATION FOR SBQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

10 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus ntusculus 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 120.. 1733 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:l: 

TGGGAGOC?rG CIACTACAGC AGCTGCTGAA CCTAGACAGG CACGGCAACT GCTGCATCCA 60 

25 

GCTCCAGCCC AACTGAATCC AGCTCCAACC ACCACTTTTG GTrCATCTTG CCTOCTGCC 119 

ATG ACC TTC TCC GAG ATT CTG GAC CGT GTT GGA AGC ATG GGC CCC TTC 167 
Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 
30 1 5 10 15 

CAG TAC CTG CAT GTG ACC TTG CTG GCC CTC CCA ATC CTC GGA ATA GCC 215 
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Gin Tyr Leu His Val Thr Leu Leu Ala Leu Pro He Leu Gly He Ala 

20 25 30 



AAC CAC AAC TTG CIA CAG ATC TTC ACA GCC ACC ACC CCT GAC CAC CRC 263 
5 Asn His Asn Leu Leu Gin He Phe Thr Ala Thr Thr Pro Asp His His 

35 40 45 



TGT CGC CCX3 C!CC CCC AAC GCC TCT CIA GAG CCC TGG GTA CTC CCC TIG 311 
Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
10 50 55 60 



GGC CCA AAC GGG AAG CCT GAG AAG TGT CTC CGC TTC GTC CAT CTG CCA 359 
Gly Pro Asn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 
65 70 75 80 

AAC GCC AGT CIT CCC AAT GAC ACC CAG GGG GCC ACC GAG CCA TCC TIG 407 
Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 

85 90 95 

20 GAT GGC TOG ATC TAC AAC AGC ACC AGA GAC ACC ATT GTC ACA GAG IGG 455 
Asp Gly Trp He Tyr Asn Ser Thr Arg Asp Thr He Val Thr Glu Trp 

100 105 110 



GAC TTG GTA TGC GGC TOC AAC AAA CTG AAG GAG ATC GCA CAG TCA GTC 503 
25 Asp Leu Val Cys Gly Ser Asn Lys Leu Lys Glu Met Ala Gin Ser Val 

115 120 125 

TTC ATC GCA GGT ATA CIG GTT GGA GGA CCT GTC TTT GGA GAA CIG TCA 551 
Phe Met Ala Gly He Leu Val Gly Gly Pro Val Phe Gly Glu Leu Ser 
30 130 135 140 



GAC AGG TIT GGC CGC AAG CCC ATC CTG ACC TCG AGC TAT CIC TIG CIG 599 
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Asp Arg Phe Gly TUrg Lys Pro lie Ijeu Thr Trp Ser Tyr Leu Leu Leu 

145 150 155 160 



GCA GCC ACT GGC TCC AGT GCT GCC TTC AGC CCC AGC CTC ACT GTC TAT 647 
5 Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 

165 170 175 



ATC ATC TTC CGA TTC CTG TGT GGC TGC AGC ATC TOG GGC ATT TCT CTG 695 
Met lie Phe Arg Phe Leu Cys Gly Cys Ser lie Ser Gly lie Ser Leu 
10 180 185 190 



AGC ACC ATT ATC TIG AAT GTG GAA TGG CTA CCC ACC TCC AOG CGG GCC 743 
Ser Thr lie lie Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg T^a 
195 200 205 



ATC TCA TCA ACA ACT ATT GGG TAC TGC TAC ACC ATT GCT CAA TTC ATT 791 
lie Ser Ser Thr Thr lie Gly Tyr Cys Tyr Thr lie Gly Gin Phe lie 
210 215 220 



20 CTG CCr GGC CTG GCC TAT GCC GTT COT CAG TGG CGC TGG CTA CAG TTG 839 
Leu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 
225 230 235 240 

TCC GTG TCT GCT GCC TTC TTC ATC TTC TCC TTG TTG TCC TOG TGG GTA 887 
25 Ser Val Ser Ala Ala Phe Phe lie Phe Ser Leu Leu Ser Trp Trp Val 

245 250 255 

CCA GAG TCC ATA CGC TGG CTG GTT CTG TCT GGA AAA TTC TCA CGA GCT ^ 935 
Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 
30 2 6 0 265 2 70 



CTG AAG ACA CTC CAA OGT GTG GCT ACC TTC AAC GGC AAG AAG GAG GAA 983 
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Leu Lys Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 
275 280 285 



GGG GAA AAG CTC ACT GTG GAG GAG CTG AAG TTC AAC TIG CAG AAG GAC 1031 
5 Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 
290 295 300 



ATC ACC TGA GCC AAG GTC AAA TAT GGC TTA TCT GAC TTG TTC CJGA CTG 1079 
lie Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
10 305 310 315 320 



TCC ATC CTG CGC OGT GTG ACC TTC TGT CTC TCT CTG GCC TGG TIT GCT 1127 
Ser lie Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 

325 330 335 



ACT GGC TTT GCC TAC TAC AGT TTG GCT ATG GGA GTA GAA GAA TTT GGA 1175 
Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 

340 345 350 



20 . GTC AAC ATC TAC ATA CTC CAG ATC ATC TTC GCT GGG GTT GAC ATT CCC 1223 
Val Asn lie Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp lie Pro 
355 360 365 



GCC AAG TTC ATC ACA ATC CTC TCC ATA AGT TAT CTG GGC OGG CGC ATC 1271 
25 Ala Lys Phe lie Thr lie Leu Ser lie Ser Tyr Leu Gly Arg Arg lie 
370 375 380 



ACT CAG GGC TTC CTC CTG ATC CTG GCA GGA GTG GCC ATC CTG GCC CTC 1319 
Thr Gin Gly Phe Leu Leu lie Leu Ala Gly Val Ala lie Leu Ala Leu 
30 385 390 395 400 



ATC TIT GTG TCT TCA GAA ATG CAG CTC TTG AGA ACA GGA CTG GCT CTA 1367 
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lie Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Veil 

405 410 415 



TIT GGG AAG GGA TGC CTG TCP GGC TCC TTC AGC TGC CTC TTC CTC TAG 1415 
5 Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 

420 425 430 



ACA AGT GAG CTC TAG CCT ACA GTC CTC AGG CAA ACA GGT ATG GGT ATC 1463 
Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly lie 
10 435 440 445 



AGT AAC ATA TGG C-CT CGA GTG GGA AGT ATG ATA GCC CCA CTG GTG AAA 1511 
Ser Asn lie Tirp Ala Arg Val Gly Ser Met lie Ala Pro Leu Val Lys 
450 455 460 



ATC ACG GGA GAA CTG CAG CCC TTC ATC CCT AAT GTC ATC TTT TOG ACC 1559 
lie Thr Gly Glu Leu Gin Pro Phe lie Pro Asn Val lie Phe Trp Thr 
465 470 475 480 



20 ATG ACT CTA CTG GGA GGC AGT GCT GCC TTC TIT CTG CTT GAG ACC CTC 1607 
Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 

485 490 495 



AAT CGG CCC TTA CCA GAA ACT ATC GAG GAC ATA CAA GAC TGG TAC CAG 1655 
25 Asn Arg Pro Leu Pro Glu Thr lie Glu Asp lie Gin Asp Trp Tyr Gin 

500 505 510 



CAA ACC AAG AAA ACA AAG CAG GAG CCA GAA GCA GAA AAG GCA TCC CAG 1703 
Gin Thr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin 
30 515 520 525 



ACA ATC CCG CTG AAG ACT GGT GGA CCC TAGCTAAGAA CAACAGAATC 1750 
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Thr lie Pro Leu Lys Thr Gly Gly Pro 
530 535 



CTcrrrccTG ccitcx::agag actgatccca agcagggccc ttccaaggct attcgagcac isio 



CTTAGGGGTr GGGTQGAGCX: CCAGCTGGCT CCMXSCrcrC AGAACAAAGA CTTCTGAGAG 1870 



TTCAGCAAAA GTGTTITACX: TTCACCACCT CCACCX3TAGC CCACAACXXA GACX7IGGCCT 1930 



10 GTTCACAGCC CTAGCCATAC TCACTCCTGC ACTCATCCTC CXITGCAACXX: AGGCCCTOCX: 1990 



A'i'iTi'i'Crcr ACCCTCTTTG TATTGGCCAT TTCCTCCATr GTCCCACCTC CATTTCCCIT 2050 



TGAGATTCCC TGGCAGTTCT AATGGTTTCC TCTTACCCTC CCCCTCGTGC CG 2102 



(2) INFORMATION FOR SEQ ID NO:2: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) IiENGIH: 537 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 
1 5 10 15 



Gin Tyr Leu His Val Thr Leu Leu Ala Leu Pro lie Leu Gly lie Ala 

20 25 30 
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Asn His Asn Leu Leu Gin lie Phe Thr Ala Thr Thr Pro Asp His His 

35 40 45 



Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
50 55 60 



Gly Pro Asn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 
65 70 75 80 



10 Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 

85 90 95 



Asp Gly Trp lie Tyr Asn Ser Thr Arg Asp Thr lie Val Thr Glu Trp 

100 105 110 

15 

Asp Leu Val Cys Gly Ser Asn Lys Leu Lys Glu Met Ala Gin Ser Val 
115 120 125 



Phe Met Ala Gly lie Leu Val Gly Gly Pro Val Phe Gly Glu Leu Ser 
20 130 135 140 



Asp Arg Phe Gly Arg Lys Pro lie Leu Thr Trp Ser Tyr Leu Leu Leu 
145 150 155 160 



25 Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 

165 170 175 



Met lie Phe Arg Phe Leu Cys Gly Cys Ser lie Ser Gly lie Ser Leu 

180 185 190 

30 

Ser Thr lie lie Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg Ala 
195 200 205 
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Ile Ser Ser Thr Thr lie Gly Tyr Cys Tyr Thr lie Gly Gin Phe lie 

210 215 220 



Leu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 



5 225 



230 



235 



240 



Ser Val Ser Ala Ala Phe Phe lie Phe Ser Leu Leu Ser Trp Trp Val 



245 



250 



255 



10 Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 



260 



265 



270 



Leu Lys Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 
275 280 285 



15 



Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 



290 



295 



300 



lie Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
20 305 310 315 3 20 



Ser lie Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 



325 



330 



335 



25 Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 



340 



345 



350 



Val T^n lie Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp lie Pro 



355 



360 



365 



30 



Ala Lys Phe lie Thr lie Leu Ser lie Ser Tyr Leu Gly Arg Arg lie 



370 



375 



380 
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Thr Gin Gly Phe Leu Leu lie Leu Ala Gly Val Ala lie Leu Ala Leu 
385 390 395 400 

lie Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Val 
5 405 410 415 

Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 

420 425 430 

10 Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly lie 

435 440 445 



Ser Asn lie Trp Ala Arg Val Gly Ser Met lie Ala Pro Leu Val Lys 
450 455 460 

15 

lie Thr Gly Glu Leu Gin Pro Phe lie Pro Asn Val lie Phe Trp Thr 
465 470 475 480 



Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 
20 485 490 495 



Asn Arg Pro Leu Pro Glu Thr lie Glu Asp lie Gin Asp Trp Tyr Gin 

500 505 510 



25 Gin Thr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin. 

515 520 525 



Thr lie Pro Leu Lys Thr Gly Gly Pro 



530 



535 



30 
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(i) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 2135 base pairs 

(B) TYPE: nucleic acid 

(C) STIU^EDNESS: double 

(D) TOPOIiOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1758 



20 (xi) SEQUENCE DESCRIPTION: SEQ It) NO: 3: 



GAATTCCGGT GATCACCAGC CCCATCGGAT CCAGACCOGG CCACCAGCTC TOGCTCGTCT 60 



TGCCCCAGTG CC ATG ACC TTC TOG GAG ATC CFG GAC OCT GTC GGA AGC 108 
25 Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser 

15 10 



ATG QGC CAT TTC CAG TTC CTG CAT GTA GCC ATA CTG GGC CTC CCG ATC 156 
Met Gly His Phe Gin Phe Leu His Val Ala lie Leu Gly Leu Pro lie 
30 15 20 25 



ore AAC ATG GCC AAC CAC AAC CTG CTG CAG ATC TTC ACA GCC GCC ACC 204 
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Leu Asn Met Ala Asn His Asn Leu Leu Gin lie Phe Thr Ala Ala Thr 
30 35 40 



CCr GTC CAC CAC TGT CGC OOG CCC CAC AAT GCC TCC ACA GGG CXJT TGG 252 
5 Pro Val His His Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp 
45 50 55 60 



GIG CrC CCC ATG GGC CCA AAT GGG AAG CCT GAG AGG TGC CTC CGT TTT 300 
Val Leu Pro Met Gly Pro Asn Gly Lys Pro Glu Arg Cys Leu Arg Phe 
10 65 70 75 



GTA CAT CCG CCC AAT GCC AGC CTG CCC AAT GAC ACC CAG AGG GCC ATG 348 
Val His Pro Pro Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met 

80 85 90 - 



GAG CCA TGC CTG GAT GGC TGG GTC TAC AAC AGC ACC AAG GAC TCC ATT 396 
Glu Pro Cys Leu Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser lie 

95 100 105 



20 GTG ACA GAG TGG GAC TTG GTG TGC AAC TCC AAC AAA CTG AAG GAG ATG 444 
Val Thr Glu Trp Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met 
110 115 120 



GCC CAG TCP ATC TTC ATG GCA GGI ATA CTG ATT GGA GGG CTC GTG CTT 492 
25 Ala Gin Ser lie Phe Met Ala Gly lie Leu lie Gly Gly Leu Val Leu 
125 130 135 140 



GGA GAC CIG TCT GAC AGG TTT GGC CGC AGG CCC ATC CTG ACC TGC AGC 540 
Gly Asp Leu Ser Asp Arg Phe Gly Arg Arg Pro lie Leu Thr Cys Ser 
30 145 150 155 



TAC CIG CTG CTG GCA GCC AGC GGC TCC GGT GCA GCC TIC AGC CCC ACC 588 
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Tyr Leu Leu Leu Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr 

160 165 170 



TTC CCC ATC TAG ATG CTC TTC CGC TTC CTG TCT GGG TTT GGC ATC TCA 
5 Phe Pro lie Tyr Met Val Phe Arg Phe Leu Cys Gly Phe Gly lie Ser 

175 180 185 



GGC ATT ACC CTG AGC ACC GTC ATC TTG AAT GTG GAA TGG GTG CCT ACC 
Gly lie Thr Leu Ser Thr Val lie Leu Asn Val Glu Trp Val Pro Thr 
10 190 195 20*0 



COG ATG GGG GCC ATC ATG TCG ACA GCA CTC GGG TAG TGC TAG ACC TTT 
Arg Met Arg Ala lie Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe 
205 210 215 220 



GGC CAG TTC ATT CTG CCC GGC CTG GCC TAG GCC ATC CCC GAG IGG GGT 
Gly Gin Phe lie Leu Pro Gly Leu Ala Tyr Ala lie Pro Gin Trp Arg 

225 230 235 



20 TGG CTG CAG TTA ACT GTG TCC ATT CCC TTC TTC GTC TTC TTC OTA TCA 
Trp Leu Gin Leu Thr Val Ser lie Pro Phe Phe Val Phe Phe Leu Ser 

240 245 250 



TCC TGG TGG ACA CCA GAG TCC ATA CGC TGG TTG GTC TTG TOT GGA AAG 
25 Ser Trp Trp Thr Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys 

255 260 265 



TCC TCG AAG GCC CTG AAG ATA CTC CGG CGG GTT GGC TGT CTT CAA TOG 
Ser Ser Lys Ala Leu Lys lie Leu Arg Arg Val Gly Cys Leu Gin Trp 
30 270 275 280 



CAA GAA GGA AGA AGG AGA AAG CTC AGC TTG GAA GAG CTC AAA CTC AAC 
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Gln Glu Gly Arg Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn 
285 290 295 300 



CTG GAG AAG GAG ATC TCC TIG GCC AAG GCC AAT TAG ACC GCA AGT GAG 1020 
5 Leu Gin Lys Glu lie Ser Leu Ala Lys Ala Asn Tyr Thr Ala Ser Asp 

305 310 315 



CTG TTC CGG ATA CCC ATG CTG CGC CGC ATG ACC TTC TCT CTT TCC CTG 1068 
Leu Phe Arg lie Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu 
10 320 325 330 



GCC TCG TTT GCT ACC GOT TIT GCC TAG TAT AGT TTG GCT ATG GGT GTG 1116 
Ala Trp Phe Ala Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val 
335 340 345 



GAA GAA TTT GGA GTC AAC CTC TAG ATC CTG GAG ATC ATC TTT GGT GGG 1164 
Glu Glu Phe Gly Val Asn Leu Tyr lie Leu Gin lie lie Phe Gly Gly 
350 355 360 



20 GTC GAT GTC CCA GCC AAG TTC ATC ACC ATC CTC TCC TTA AGC TAG CTG 1212 
Val Asp Val Pro Ala Lys Phe lie Thr lie Leu Ser Leu Ser Tyr Leu 
365 370 375 380 



GGC CGG CAT ACC ACT GAG GCC GCT GCC CTG CTC CTG GCA GGA GGG GCC 1260 
25 Gly Arg His Thr Thr Gin Ala Ala Ala Leu Leu Leu Ala Gly Gly Ala 

385 390 395 



ATC TIG GCT CTC ACC TIT CTG CCC TIG GAC TTG CAG ACC CTG AGG ACA 1308 
lie Leu Ala Leu Thr Phe Val Pro Leu Asp Leu Gin Thr Val Arg Thr 
30 400 405 410 



CTA TTG GCT GTG TIT GGG AAG GGA TGC CTA TCC AGC TCC TTC AGC TGC 1356 
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Val Leu Ala Val Phe Gly Lys Gly Cys Leu Ser Ser Ser Phe Ser Cys 



415 



420 



425 



crc TTC crc tag aca agt gaa tta tag gcg aca gtg atc agg caa aca 

5 Leu Phe Leu Tyr Thr Ser Glu Leu Tyr Pro Thr Val lie Arg Gin Thr 



430 



435 



440 



1404 



GGT ATG GGC GTA AGT AAC CTG TGG ACC OGC GTG GGA AGC ATC GIG TCC 
Gly Met Gly Val Ser Asn Leu Trp Thr Arg Val Gly Ser Met Val Ser 



10 445 



450 



455 



460 



1452 



15 



COG CTG GTG AAA ATC ACG GGT GAG GTA CAG CCC TTC ATC CCC AAT ATC 
Pro Leu Val Lys lie Thr Gly Glu Val Gin Pro Phe lie Pro Asn lie 



465 



470 



475 



1500 



ATC TAG GGG ATC ACC GCC CTC CTC GGG GGC AGT GCT GCC CTC TIC CTG 
lie Tyr Gly lie Thr Ala Leu Leu Gly Gly Ser Ala Ala Leu Phe Leu 



480 



485 



490 



1548 



20 CCr GAG ACC CIG AAT CAG CCC TTG CCA GAG ACT ATC GAA GAC CIG GAA 
Pro Glu Thr Leu Asn Gin Pro Leu Pro Glu Thr lie Glu Asp Leu Glu 



495 



500 



505 



1596 



T^C TGG TCC CTG COG GCA AAG AAG CCA AAG CAG GAG CCA GAG GTG GAA 

r 

25 Asn Trp Ser Leu Arg Ala Lys Lys Pro Lys Gin Glu Pro Glu Val Glu 



510 



515 



520 



1644 



AAG GCC TCC CAG AGG ATC CTG TAG AGC CTC ACG GAC CAG GCC TGG GCT 
Lys Ala Ser Gin Arg lie Leu Tyr Ser Leu Thr Asp Gin Ala Trp Ala 



30 525 



530 



535 



540 



1692 



CCA GCT GAG GAC AAC GGA ACC CCC TTT CCC TGC CCT CCA GAG ACT GAT 



1740 
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Pro Ala Glu Asp Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp 

545 550 555 



CCT AGC CAG GCA CCT TAGGAGTATA GGGAGGCCXX: ATATAGGTCC ATCCTCCTAG 1795 
5 Pro Ser Gin Ala Pro 

560 



GATGAAGCCT TCTGAGAGCT TOGrGAAGGT GTCKXATCA CCACCACCAG AGCCTCXTTGC 1855 



10 CCAGCCCTGG CCAGTTCAAA GGTTCAGCCA TCCCTGCCCT TGTTCTCCCT GCAACCCAGG 1915 



CCCTGCCATT CTTCrGTCTA GCCCTTCCCC ACTGGCCACC TTCCCCCACT GTCCXX3GTCC 1975 



TCTTCCCCro AGGTOwCCTG ATATCCCC3X5 GCTCAGTCCT AACAAGACTG AGTCTIAACA 2035 



AGATGAGAAG TCCTCaXJIT CITGCCTCCC ACACITTTCr TTGATGGGAG GTTTCAATAA 2095 



ACAGCGATAA GAACTCTAAA AAAAAAAAAA ACCXSGAATTC 2135 



(2) INFORMATION FOR SEQ ID NO:4: 



(i) SEQUENCE GHARACTERISTICS : 

(A) IjENGTH: 561 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOIiECUIiE TYPE: protein 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly His Phe 
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15 10 15 



Gin Phe Leu His Val Ala lie Leu Gly Leu Pro lie Leu Asn Met Ala 

20 25 30 



Asri His Asn Leu Leu Gin lie Phe Thr Ala Ala Thr Pro Val His His 

35 40 45 



Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp Val Leu Pro Met 
10 50 55 60 



Gly Pro Asn Gly Lys Pro Glu Arg C?ys Leu Arg Phe Val His Pro Pro 
65 70 75 80 



15 Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met Glu Pro cys Leu 

85 90 95 



Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser lie Val Thr Glu Trp 

100 105 110 

20 

Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met Ala Gin Ser lie 

115 120 125 



Phe Met Ala Gly lie Leu lie Gly Gly Leu Val Leu Gly Asp Leu Ser 
25 130 135 140 



Asp Arg Phe Gly Arg Arg Pro lie Leu Thr Cys Ser Tyr Leu Leu Leu 
145 150 155 160 



30 Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr Phe Pro lie Tyr 

165 170 175 
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Met Val Phe Arg Phe Leu Cys Gly Phe Gly lie Ser Gly lie Thr Leu 

180 185 190 



Ser Thr Val lie Leu Asn Val Glu Trp Val Pro Thr Arg Met Arg Ala 
195 200 205 



lie Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe Gly Gin Phe lie 
210 215 220 



10 Leu Pro Gly Leu Ala Tyr Ala lie Pro Gin Trp Arg Trp Leu Gin Leu 
225 230 235 240 



Thr Val Ser lie Pro Phe Phe Val Phe Phe Leu Ser Ser Trp Trp Thr 

245 250 255 

15 

Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Ser Ser Lys Ala 

260 265 270 



Leu Lys lie Leu Arg Arg Val Gly Cys Leu Gin Trp Gin Glu Gly Arg 
20 275 280 285 



Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn Leu Gin Lys Glu 
290 295 300 



25 lie Ser Leu Ala Lys Ala. Asn Tyr Thr Ala Ser Asp Leu Phe Arg lie 
305 310 315 320 



Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 

325 330 335 

30 

Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 

340 345 350 
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Val Asn Leu Tyr lie 
355 

Ma Lys Phe lie Thr 
5 370 

Thr Gin Ala Ala Ala 
385 

10 Thr Phe Val Pro Leu 

405 

Phe Gly Lys Gly Cys 

420 

15 

Thr Ser Glu Leu Tyr 
435 

Ser Asn Leu Trp Thr 
20 450 

He Thr Gly Glu Val 
465 

25 Thr Ala Leu Leu Gly 

485 

Asn Gin Pro Leu Pro 

500 

30 

Arg Ala Lys Lys Pro 
515 



-38- 

Leu Gin He He Phe Gly 
360 

lie Leu Ser Leu Ser Tyr 
375 

Leu Leu Leu Ala Gly Gly 
390 395 

Asp Leu Gin Thr Val Arg 

410 

Leu Ser Ser Ser Phe Ser 

425 

Pro Thr Val He Arg Gin 
440 

Arg Val Gly Ser Met Val 
455 

Gin Pro Phe He Pro Asn 
470 475 

Gly Ser Ala Ala Leu Phe 

490 

Glu Thr He Glu Asp Leu 

505 

Lys Gin Glu Pro Glu Val 
520 



PCT/US97/07856 

Gly Val Asp Val Pro 
365 

Leu Gly Arg His Thr 
380 

Ala He Leu Ala Leu 

400 

Thr Val Leu Ala Val 

415 

Cys Leu Phe Leu Tyr 
430 

Thr Gly Met Gly Val 
445 

Ser Pro Leu Val Lys 
460 

He He Tyr Gly He 

480 

Leu Pro Glu Thr Leu 

495 

Glu Asn Trp Ser Leu 
510 

Glu Lys Ala Ser Gin 
525 
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Arg lie Ijeu Tyr Ser Leu Thr Asp Gin Ala Trp Ala Pro Ala Glu Asp 
530 535 540 



Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp Pro Ser Gin Ala 
5 545 550 555 560 



Pro 



10 

(2) INFORMATION FOR SBQ ID NO: 5 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



20 



(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



25 (v) FRAGMENT TYPE: internal 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Asp Arg Phe Gly Aig Lys 
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(2) INFORMATION FOR SBQ ID N0:6: 



5 (i) SEQUENCE CHARACTERISTICS: 

(A) IiENGIH: 5 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

10 

(ii) MOIiECUIiE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
15 (iv) ANTI-SENSE: NO 



(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 
20 (A) NAME/KEY: Region 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa at position 2 is Arg or 
Lys" 



25 (ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "Xaa at position 5 is Arg or 
Lys" 

30 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 
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Asp Xaa Xaa 7^ Xaa 



(2) INFORMATION FOR SBQ ID NO:7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



15 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



20 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 7 



AAGTTCATCA CCATCCTCTC CirAAGCTAC CTGGGCOGGC ATACCACTCA QGCOGCTGCC 



25 TGCrCCTGGC AGGAGGGGCC ATCITQGCTC TCACCTTITG CCCTTOGACT TGCAGACOCT 



GAGACACTAT TOGdGTGTT TGGGAAGGGA TGCCTATCCA GCTCCTTCAG CTGCCTCITC 



CTCTACACAA GTCAATTATA CCCCACAGTC ATCAGGCAAA CAGGTATGGG OGTAAGTAAC 

30 

CIXJCGGACCC GCGTGGGAAG CATGGTGTCC CGCTGGTGAA AATCAGGGCT GAGGTACAGC 



3NSDOCID: <WO 9742321A1JB> 



wo 97/42321 

^2 PCTAJS97/07856 

CCITCATCCC CAATATCATC TACGGGAT 

328 
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CLAIMS 

1 . An isolated nucleic acid molecule 

(a) which hybridizes under stringent conditions to a molecule consisting of the nucleic 
5 acid sequence of SEQ ID NO:l or SEQ ID NO:7 and which codes for an osteoclast transporter 

molecule, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 

10 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
hybridizes under stringent conditions to a molecule consisting of the nucleic acid sequence of 
SEQ IDNO:3. 

15 3. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO: 3. 

4. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO: 1 . 

20 

5. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 
fragment of SEQ ID NOrl between 12 and 2101 nucleotides in length, (b) a unique fragment of 
nucleotides 1-2112 of SEQ ID NO:3 between 12 and 21 1 1 nucleotides in length, (c) 
complements of "(a)" and (d) complements of "(b)". 

25 

6. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of SEQ ID NO:l between 12 and 1974 nucleotides in length. 

7. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
30 is a unique fragment of nucleotides 120-1733 of SEQ ID NO:l between 12 and 1612 nucleotides 

in length. 
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8. The isolated nucleic acid molecule of claim 5. wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 1-1 179 of SEQ ID 
NO:3 between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512- 
2112 of SEQ ID NO:3 between 12 and 599 nucleotides in length. 

9. The isolated nucleic acid molecule of claim 5. wherein the isolated nucleic acid molecule 
is a unique fragment of nucleotides 73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides 
in length. 



10 10. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 73- 1 1 79 of SEQ ID 
NO: 3 between 12 and 1 105 nucleotides in length and a unique fragment of nucleotides 1512- 
1758 of SEQ ID NO:3 between 12 and 245 nucleotides in length, 

15 11. The isolated nucleic acid molecule of any of claims 5-10, wherein the isolated nucleic 
acid molecule is selected from the group consisting of a unique fragment of at least 14 
contiguous nucleotides, a unique fragment of at least 15 contiguous nucleotides, a unique 
fragment of at least 16 contiguous nucleotides, a unique fragment of at least 17 contiguous 
nucleotides, a unique fragment of at least 18 contiguous nucleotides, a unique fragment of at 

20 least 20 contiguous nucleotides and a unique fragment of at least 22 contiguous nucleotides. 

12. The isolated nucleic acid molecule of any of claims 5- 1 0, wherein the isolated nucleic 
acid molecule consists of between 12 and 32 contiguous nucleotides. 

25 13. A host cell transformed or transfected with an expression vector comprising the isolated 
nucleic acid molecule of any of claims 1, 2, 3 or 4 operably linked to a promoter. 

1 4. An isolated polypeptide coded for by the isolated nucleic acid molecule of any of claims 
1.2, 3 or 4. 



30 



1 5. An isolated polypeptide selectively binding a protein coded for by the isolated nucleic 
acid molecule of any of claims 1, 2, 3 or 4 . 
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16. The isolated polypeptide of claim 15, wherein the isolated polypeptide is an Fab or F(ab) 
fragment of an antibody. 

17. The isolated polypeptide of claim 15 wherein the isolated polypeptide is a fragment of an 
5 antibody, the fragment including a CDR3 region selective for the protein. 

1 8. The isolated polypeptide of claim 1 5, wherein the isolated polypeptide is a monoclonal 
antibody. 

10 19. The isolated polypeptide of claim 15, 16 or 17, wherein the isolated polypeptide 
selectively binds an extracellular portion of the protein. 

20. A method for decreasing osteoclast activity in a subject comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
15 isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease osteoclast activity in said subject. 

21 . A method as claimed in claim 20, wherein the agent is a modified nucleic acid. 

20 22. A method as claimed in claim 20, wherein the agent is a polypeptide. 

23. The use of an agent that selectively binds to an isolated nucleic acid molecule of claim 1, 
or an expression product thereof, in the preparation of a medicament. 

25 24. The use as claimed in claim 23, wherein the agent is a modified nucleic acid. 

25. The use as claimed in claim 23, wherein the agent is a polypeptide. 
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